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Preface 



I am frequently asked questions such as 'What are fractals?', 'What is fractal 
dimension?', 'How can one find the dimension of a fractal and what does it 
tell us anyway?' or 'How can mathematics be applied to fractals?' This book 
endeavours to answer some of these questions. 

The main aim of the book is to provide a treatment of the mathematics asso- 
ciated with fractals and dimensions at a level which is reasonably accessible to 
those who encounter fractals in mathematics or science. Although basically a 
mathematics book, it attempts to provide an intuitive as well as a mathematical 
insight into the subject. 

The book falls naturally into two parts. Part I is concerned with the general 
theory of fractals and their geometry. Firstly, various notions of dimension and 
methods for their calculation are introduced. Then geometrical properties of frac- 
tals are investigated in much the same way as one might study the geometry of 
classical figures such as circles or ellipses: locally a circle may be approximated 
by a line segment, the projection or 'shadow' of a circle is generally an ellipse, 
a circle typically intersects a straight line segment in two points (if at all), and 
so on. There are fractal analogues of such properties, usually with dimension 
playing a key role. Thus we consider, for example, the local form of fractals, 
and projections and intersections of fractals. 

Part II of the book contains examples of fractals, to which the theory of the 
first part may be applied, drawn from a wide variety of areas of mathematics 
and physics. Topics include self-similar and self-affine sets, graphs of functions, 
examples from number theory and pure mathematics, dynamical systems, Julia 
sets, random fractals and some physical applications. 

There are many diagrams in the text and frequent illustrative examples. Com- 
puter drawings of a variety of fractals are included, and it is hoped that enough 
information is provided to enable readers with a knowledge of programming to 
produce further drawings for themselves. 

It is hoped that the book will be a useful reference for researchers, providing 
an accessible development of the mathematics underlying fractals and showing 
how it may be applied in particular cases. The book covers a wide variety of 
mathematical ideas that may be related to fractals, and, particularly in Part II, 
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provides a flavour of what is available rather than exploring any one subject 
in too much detail. The selection of topics is to some extent at the author's 
whim — there are certainly some important applications that are not included. 
Some of the material dates back to early in the twentieth century whilst some is 
very recent. 

Notes and references are provided at the end of each chapter. The references 
are by no means exhaustive, indeed complete references on the variety of topics 
covered would fill a large volume. However, it is hoped that enough information 
is included to enable those who wish to do so to pursue any topic further. 

It would be possible to use the book as a basis for a course on the mathe- 
matics of fractals, at postgraduate or, perhaps, final-year undergraduate level, and 
exercises are included at the end of each chapter to facilitate this. Harder sections 
and proofs are marked with an asterisk, and may be omitted without interrupting 
the development. 

An effort has been made to keep the mathematics to a level that can be under- 
stood by a mathematics or physics graduate, and, for the most part, by a diligent 
final-year undergraduate. In particular, measure theoretic ideas have been kept to 
a minimum, and the reader is encouraged to think of measures as 'mass distribu- 
tions' on sets. Provided that it is accepted that measures with certain (intuitively 
almost obvious) properties exist, there is little need for technical measure theory 
in our development. 

Results are always stated precisely to avoid the confusion which would other- 
wise result. Our approach is generally rigorous, but some of the harder or more 
technical proofs are either just sketched or omitted altogether. (However, a few 
harder proofs that are not available in that form elsewhere have been included, in 
particular those on sets with large intersection and on random fractals.) Suitable 
diagrams can be a help in understanding the proofs, many of which are of a 
geometric nature. Some diagrams are included in the book; the reader may find 
it helpful to draw others. 

Chapter 1 begins with a rapid survey of some basic mathematical concepts 
and notation, for example, from the theory of sets and functions, that are used 
throughout the book. It also includes an introductory section on measure theory 
and mass distributions which, it is hoped, will be found adequate. The section 
on probability theory may be helpful for the chapters on random fractals and 
Brownian motion. 

With the wide variety of topics covered it is impossible to be entirely consistent 
in use of notation and inevitably there sometimes has to be a compromise between 
consistency within the book and standard usage. 

In the last few years fractals have become enormously popular as an art form, 
with the advent of computer graphics, and as a model of a wide variety of physical 
phenomena. Whilst it is possible in some ways to appreciate fractals with little or 
no knowledge of their mathematics, an understanding of the mathematics that can 
be applied to such a diversity of objects certainly enhances one's appreciation. 
The phrase 'the beauty of fractals' is often heard — it is the author's belief that 
much of their beauty is to be found in their mathematics. 
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It is a pleasure to acknowledge those who have assisted in the preparation 
of this book. Philip Drazin and Geoffrey Grimmett provided helpful comments 
on parts of the manuscript. Peter Shiarly gave valuable help with the computer 
drawings and Aidan Foss produced some diagrams. I am indebted to Charlotte 
Farmer, Jackie Cowling and Stuart Gale of John Wiley and Sons for overseeing 
the production of the book. 

Special thanks are due to David Marsh — not only did he make many useful 
comments on the manuscript and produce many of the computer pictures, but he 
also typed the entire manuscript in a most expert way. 

Finally, I would like to thank my wife Isobel for her support and encourage- 
ment, which extended to reading various drafts of the book. 



Kenneth J. Falconer 

Bristol, April 1989 



Preface to the second edition 



It is thirteen years since Fractal Geometry — Mathematical Foundations and Appli- 
cations was first published. In the meantime, the mathematics and applications of 
fractals have advanced enormously, with an ever-widening interest in the subject 
at all levels. The book was originally written for those working in mathematics 
and science who wished to know more about fractal mathematics. Over the past 
few years, with changing interests and approaches to mathematics teaching, many 
universities have introduced undergraduate and postgraduate courses on fractal 
geometry, and a considerable number have been based on parts of this book. 

Thus, this new edition has two main aims. First, it indicates some recent devel- 
opments in the subject, with updated notes and suggestions for further reading. 
Secondly, more attention is given to the needs of students using the book as a 
course text, with extra details to help understanding, along with the inclusion of 
further exercises. 

Parts of the book have been rewritten. In particular, multifractal theory has 
advanced considerably since the first edition was published, so the chapter on 
'Multifractal Measures' has been completely rewritten. The notes and references 
have been updated. Numerous minor changes, corrections and additions have 
been incorporated, and some of the notation and terminology has been changed to 
conform with what has become standard usage. Many of the diagrams have been 
replaced to take advantage of the more sophisticated computer technology now 
available. Where possible, the numbering of sections, equations and figures has 
been left as in the first edition, so that earlier references to the book remain valid. 

Further exercises have been added at the end of the chapters. Solutions to these 
exercises and additional supplementary material may be found on the world wide 
web at 

http ://ww w . wileyeurope . com/fractal 

In 1997 a sequel, Techniques in Fractal Geometry, was published, presenting 
a variety of techniques and ideas current in fractal research. Readers wishing 
to study fractal mathematics beyond the bounds of this book may find the 
sequel helpful. 

I am most grateful to all who have made constructive suggestions on the text. In 
particular I am indebted to Carmen Fernandez, Gwyneth Stallard and Alex Cain 
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for help with this revision. I am also very grateful for the continuing support 
given to the book by the staff of John Wiley & Sons, and in particular to Rob 
Calver and Lucy Bryan, for overseeing the production of this second edition and 
John O'Connor and Louise Page for the cover design. 



Kenneth J. Falconer 

St Andrews, January 2003 



Course suggestions 



There is far too much material in this book for a standard length course on 
fractal geometry. Depending on the emphasis required, appropriate sections may 
be selected as a basis for an undergraduate or postgraduate course. 

A course for mathematics students could be based on the following sections. 

(a) Mathematical background 

1.1 Basic set theory; 1.2 Functions and limits; 1.3 Measures and mass 
distributions. 

(b) Box-counting dimensions 

3.1 Box-counting dimensions; 3.2 Properties of box-counting dimensions. 

(c) Hausdorff measures and dimension 

2.1 Hausdorff measure; 2.2 Hausdorff dimension; 2.3 Calculation of Haus- 
dorff dimension; 4.1 Basic methods of calculating dimensions. 

(d) Iterated function systems 

9.1 Iterated function systems; 9.2 Dimensions of self-similar sets; 9.3 Some 
variations; 10.2 Continued fraction examples. 

(e) Graphs of functions 

11.1 Dimensions of graphs, the Weierstrass function and self-affine graphs. 

(f) Dynamical systems 

13.1 Repellers and iterated function systems; 13.2 The logistic map. 

(g) Iteration of complex functions 

14.1 Sketch of general theory of Julia sets; 14.2 The Mandelbrot set; 14.3 
Julia sets of quadratic functions. 
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In the past, mathematics has been concerned largely with sets and functions to 
which the methods of classical calculus can be applied. Sets or functions that 
are not sufficiently smooth or regular have tended to be ignored as 'pathological' 
and not worthy of study. Certainly, they were regarded as individual curiosities 
and only rarely were thought of as a class to which a general theory might be 
applicable. 

In recent years this attitude has changed. It has been realized that a great deal 
can be said, and is worth saying, about the mathematics of non-smooth objects. 
Moreover, irregular sets provide a much better representation of many natural 
phenomena than do the figures of classical geometry. Fractal geometry provides 
a general framework for the study of such irregular sets. 

We begin by looking briefly at a number of simple examples of fractals, and 
note some of their features. 

The middle third Cantor set is one of the best known and most easily con- 
structed fractals; nevertheless it displays many typical fractal characteristics. It 
is constructed from a unit interval by a sequence of deletion operations; see 
figure 0.1. Let Eq be the interval [0, 1]. (Recall that [a, b] denotes the set of real 
numbers x such that a ^ x ^ b.) Let E\ be the set obtained by deleting the mid- 
dle third of Eq, so that E\ consists of the two intervals [0, 4] and [|, 1], Deleting 
the middle thirds of these intervals gives E<i, thus Ei comprises the four intervals 
[0, \\, [§, |], [§, |], [§, 1]. We continue in this way, with E k obtained by delet- 
ing the middle third of each interval in E^-i. Thus Ek consists of 2 k intervals 
each of length 3~ k . The middle third Cantor set F consists of the numbers that 
are in E^ for all k ; mathematically, F is the intersection H£=o ^ k ■ ^ ne Cantor 
set F may be thought of as the limit of the sequence of sets E^ as k tends to 
infinity. It is obviously impossible to draw the set F itself, with its infinitesimal 
detail, so 'pictures of F' tend to be pictures of one of the Ek, which are a good 
approximation to F when k is reasonably large; see figure 0.1. 

At first glance it might appear that we have removed so much of the interval 
[0, 1] during the construction of F, that nothing remains. In fact, F is an infinite 
(and indeed uncountable) set, which contains infinitely many numbers in every 
neighbourhood of each of its points. The middle third Cantor set F consists 
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Figure 0.1 Construction of the middle third Cantor set F, by repeated removal of the 
middle third of intervals. Note that F L and F R , the left and right parts of F, are copies 
of F scaled by a factor | 

precisely of those numbers in [0, 1] whose base-3 expansion does not contain 
the digit 1, i.e. all numbers a t 3~ l + a^ir 1 + cii,3~ 3 + ■ ■ ■ with a, = or 2 for 
each i. To see this, note that to get E\ from Eq we remove those numbers with 
<X\ = 1, to get £2 from Ei we remove those numbers with a.2 — 1, and so on. 

We list some of the features of the middle third Cantor set F; as we shall see, 
similar features are found in many fractals. 

(i) F is self-similar. It is clear that the part of F in the interval [0, \] and the 

part of F in [|, 1] are both geometrically similar to F, scaled by a factor 

| . Again, the parts of F in each of the four intervals of Ei are similar to 

F but scaled by a factor |, and so on. The Cantor set contains copies of 

itself at many different scales, 
(ii) The set F has a 'fine structure'; that is, it contains detail at arbitrarily 

small scales. The more we enlarge the picture of the Cantor set, the more 

gaps become apparent to the eye. 
(iii) Although F has an intricate detailed structure, the actual definition of F 

is very straightforward, 
(iv) F is obtained by a recursive procedure. Our construction consisted of 

repeatedly removing the middle thirds of intervals. Successive steps give 

increasingly good approximations £* to the set F. 
(v) The geometry of F is not easily described in classical terms: it is not the 

locus of the points that satisfy some simple geometric condition, nor is it 

the set of solutions of any simple equation, 
(vi) It is awkward to describe the local geometry of F — near each of its points 

are a large number of other points, separated by gaps of varying lengths, 
(vii) Although F is in some ways quite a large set (it is uncountably infinite), 

its size is not quantified by the usual measures such as length — by any 

reasonable definition F has length zero. 

Our second example, the von Koch curve, will also be familiar to many readers; 
see figure 0.2. We let Eq be a line segment of unit length. The set E\ consists of 
the four segments obtained by removing the middle third of Eq and replacing it 
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(a) 




Figure 0.2 (a) Construction of the von Koch curve F . At each stage, the middle third of 
each interval is replaced by the other two sides of an equilateral triangle, (b) Three von 
Koch curves fitted together to form a snowflake curve 



by the other two sides of the equilateral triangle based on the removed segment. 
We construct E^ by applying the same procedure to each of the segments in E\, 
and so on. Thus E^ comes from replacing the middle third of each straight line 
segment of E^_\ by the other two sides of an equilateral triangle. When k is 



Introduction 



large, the curves Ek-\ and E^ differ only in fine detail and as k tends to infinity, 
the sequence of polygonal curves E^ approaches a limiting curve F, called the 
von Koch curve. 

The von Koch curve has features in many ways similar to those listed for 
the middle third Cantor set. It is made up of four 'quarters' each similar to the 
whole, but scaled by a factor | . The fine structure is reflected in the irregularities 
at all scales; nevertheless, this intricate structure stems from a basically simple 
construction. Whilst it is reasonable to call F a curve, it is much too irregular 
to have tangents in the classical sense. A simple calculation shows that E^ is of 
length (|) ; letting k tend to infinity implies that F has infinite length. On the 
other hand, F occupies zero area in the plane, so neither length nor area provides 
a very useful description of the size of F. 

Many other sets may be constructed using such recursive procedures. For 
example, the Sierpiriski triangle or gasket is obtained by repeatedly removing 
(inverted) equilateral triangles from an initial equilateral triangle of unit side- 
length; see figure 0.3. (For many purposes, it is better to think of this procedure 
as repeatedly replacing an equilateral triangle by three triangles of half the height.) 
A plane analogue of the Cantor set, a 'Cantor dust', is illustrated in figure 0.4. At 
each stage each remaining square is divided into 16 smaller squares of which four 
are kept and the rest discarded. (Of course, other arrangements or numbers of 
squares could be used to get different sets.) It should be clear that such examples 
have properties similar to those mentioned in connection with the Cantor set and 
the von Koch curve. The example depicted in figure 0.5 is constructed using two 
different similarity ratios. 

There are many other types of construction, some of which will be discussed 
in detail later in the book, that also lead to sets with these sorts of properties. 
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Figure 0.3 Construction of the Sierpiriski triangle (dim H F = dim B F = log 3/ log 2) 
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Figure 0.4 Construction of a 'Cantor dust' (dim H F = dim B F = 1) 







X, 




Figure 0.5 Construction of a self-similar fractal with two different similarity ratios 
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The highly intricate structure of the Julia set illustrated in figure 0.6 stems from 
the single quadratic function f(z) — z 2 + c for a suitable constant c. Although 
the set is not strictly self-similar in the sense that the Cantor set and von Koch 
curve are, it is 'quasi-self-similar' in that arbitrarily small portions of the set can 
be magnified and then distorted smoothly to coincide with a large part of the set. 

Figure 0.7 shows the graph of the function /(f) = '^'^L (j)~ k ^ 2 sin((|)*t); the 
infinite summation leads to the graph having a fine structure, rather than being a 
smooth curve to which classical calculus is applicable. 

Some of these constructions may be 'randomized'. Figure 0.8 shows a 'random 
von Koch curve' — a coin was tossed at each step in the construction to determine 
on which side of the curve to place the new pair of line segments. This random 
curve certainly has a fine structure, but the strict self-similarity of the von Koch 
curve has been replaced by a 'statistical self-similarity'. 

These are all examples of sets that are commonly referred to as fractals. (The 
word 'fractal' was coined by Mandelbrot in his fundamental essay from the Latin 
fractus, meaning broken, to describe objects that were too irregular to fit into a 
traditional geometrical setting.) Properties such as those listed for the Cantor set 
are characteristic of fractals, and it is sets with such properties that we will have 
in mind throughout the book. Certainly, any fractal worthy of the name will 
have a fine structure, i.e. detail at all scales. Many fractals have some degree of 
self-similarity — they are made up of parts that resemble the whole in some way. 
Sometimes, the resemblance may be weaker than strict geometrical similarity; 
for example, the similarity may be approximate or statistical. 

Methods of classical geometry and calculus are unsuited to studying frac- 
tals and we need alternative techniques. The main tool of fractal geometry 
is dimension in its many forms. We are familiar enough with the idea that a 




a A v 




LAMf 



*t 



*igS 



Figure 0.6 A Julia set 
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Figure 0.7 Graph of f(t) = ££ (f) 7 sin((§rO 



(smooth) curve is a 1 -dimensional object and a surface is 2-dimensional. It is 
less clear that, for many purposes, the Cantor set should be regarded as having 
dimension log 2/ log 3 = 0.631 . . . and the von Koch curve as having dimen- 
sion log 4/ log 3=1 .262 .... This latter number is, at least, consistent with the 
von Koch curve being 'larger than 1 -dimensional' (having infinite length) and 
'smaller than 2-dimensional' (having zero area). 




Figure 0.8 A random version of the von Koch curve 
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(c) 





id) 

Figure 0.9 Division of certain sets into four parts. The parts are similar to the whole with 
ratios: j for line segment (a); j for square (b); i for middle third Cantor set (c); | for 
von Koch curve (d) 



The following argument gives one (rather crude) interpretation of the meaning 
of these 'dimensions' indicating how they reflect scaling properties and self- 
similarity. As figure 0.9 indicates, a line segment is made up of four copies of 
itself, scaled by a factor i . The segment has dimension — log 4/ log i = 1 . A 
square, however, is made up of four copies of itself scaled by a factor | (i.e. 
with half the side length) and has dimension — log 4/ log j — 2. In the same way, 
the von Koch curve is made up of four copies of itself scaled by a factor i and 
has dimension — log 4/ log i = log 4/ log 3, and the Cantor set may be regarded 
as comprising four copies of itself scaled by a factor | and having dimension 
— log 4/ log | = log 2/ log 3. In general, a set made up of m copies of itself scaled 
by a factor r might be thought of as having dimension — log m/ log r . The number 
obtained in this way is usually referred to as the similarity dimension of the set. 

Unfortunately, similarity dimension is meaningful only for a relatively small 
class of strictly self-similar sets. Nevertheless, there are other definitions of 
dimension that are much more widely applicable. For example, Hausdorff dimen- 
sion and the box-counting dimensions may be defined for any sets, and, in 
these four examples, may be shown to equal the similarity dimension. The early 
chapters of the book are concerned with the definition and properties of Hausdorff 
and other dimensions, along with methods for their calculation. Very roughly, a 
dimension provides a description of how much space a set fills. It is a measure of 
the prominence of the irregularities of a set when viewed at very small scales. A 
dimension contains much information about the geometrical properties of a set. 

A word of warning is appropriate at this point. It is possible to define the 
'dimension' of a set in many ways, some satisfactory and others less so. It 
is important to realize that different definitions may give different values of 
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dimension for the same set, and may also have very different properties. Incon- 
sistent usage has sometimes led to considerable confusion. In particular, warning 
lights flash in my mind (as in the minds of other mathematicians) whenever the 
term 'fractal dimension' is seen. Though some authors attach a precise meaning 
to this, I have known others interpret it inconsistently in a single piece of work. 
The reader should always be aware of the definition in use in any discussion. 

In his original essay, Mandelbrot defined a fractal to be a set with Haus- 
dorff dimension strictly greater than its topological dimension. (The topological 
dimension of a set is always an integer and is if it is totally disconnected, 1 if 
each point has arbitrarily small neighbourhoods with boundary of dimension 0, 
and so on.) This definition proved to be unsatisfactory in that it excluded a num- 
ber of sets that clearly ought to be regarded as fractals. Various other definitions 
have been proposed, but they all seem to have this same drawback. 

My personal feeling is that the definition of a 'fractal' should be regarded in 
the same way as a biologist regards the definition of 'life'. There is no hard and 
fast definition, but just a list of properties characteristic of a living thing, such 
as the ability to reproduce or to move or to exist to some extent independently 
of the environment. Most living things have most of the characteristics on the 
list, though there are living objects that are exceptions to each of them. In the 
same way, it seems best to regard a fractal as a set that has properties such 
as those listed below, rather than to look for a precise definition which will 
almost certainly exclude some interesting cases. From the mathematician's point 
of view, this approach is no bad thing. It is difficult to avoid developing properties 
of dimension other than in a way that applies to 'fractal' and 'non-fractal' sets 
alike. For 'non-fractals', however, such properties are of little interest — they are 
generally almost obvious and could be obtained more easily by other methods. 

When we refer to a set F as a fractal, therefore, we will typically have the 
following in mind. 

(i) F has a fine structure, i.e. detail on arbitrarily small scales, 
(ii) F is too irregular to be described in traditional geometrical language, both 

locally and globally, 
(iii) Often F has some form of self-similarity, perhaps approximate or statis- 
tical. 
(iv) Usually, the 'fractal dimension' of F (defined in some way) is greater 

than its topological dimension, 
(v) In most cases of interest F is defined in a very simple way, perhaps 
recursively. 

What can we say about the geometry of as diverse a class of objects as frac- 
tals? Classical geometry gives us a clue. In Part I of this book we study certain 
analogues of familiar geometrical properties in the fractal situation. The orthog- 
onal projection, or 'shadow' of a circle in space onto a plane is, in general, an 
ellipse. The fractal projection theorems tell us about the 'shadows' of a fractal. 
For many purposes, a tangent provides a good local approximation to a circle. 
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Though fractals do tend not to have tangents in any sense, it is often possi- 
ble to say a surprising amount about their local form. Two circles in the plane 
in 'general position' either intersect in two points or not at all (we regard the 
case of mutual tangents as 'exceptional'). Using dimension, we can make sim- 
ilar statements about the intersection of fractals. Moving a circle perpendicular 
to its plane sweeps out a cylinder, with properties that are related to those of 
the original circle. Similar, and indeed more general, constructions are possible 
with fractals. 

Although classical geometry is of considerable intrinsic interest, it is also called 
upon widely in other areas of mathematics. For example, circles or parabolae 
occur as the solution curves of certain differential equations, and a knowledge of 
the geometrical properties of such curves aids our understanding of the differential 
equations. In the same way, the general theory of fractal geometry can be applied 
to the many branches of mathematics in which fractals occur. Various examples 
of this are given in Part II of the book. 

Historically, interest in geometry has been stimulated by its applications to 
nature. The ellipse assumed importance as the shape of planetary orbits, as did 
the sphere as the shape of the earth. The geometry of the ellipse and sphere can 
be applied to these physical situations. Of course, orbits are not quite elliptical, 
and the earth is not actually spherical, but for many purposes, such as the pre- 
diction of planetary motion or the study of the earth's gravitational field, these 
approximations may be perfectly adequate. 

A similar situation pertains with fractals. A glance at the recent physics liter- 
ature shows the variety of natural objects that are described as fractals — cloud 
boundaries, topographical surfaces, coastlines, turbulence in fluids, and so on. 
None of these are actual fractals — their fractal features disappear if they are 
viewed at sufficiently small scales. Nevertheless, over certain ranges of scale 
they appear very much like fractals, and at such scales may usefully be regarded 
as such. The distinction between 'natural fractals' and the mathematical 'frac- 
tal sets' that might be used to describe them was emphasized in Mandelbrot's 
original essay, but this distinction seems to have become somewhat blurred. 
There are no true fractals in nature. (There are no true straight lines or cir- 
cles either!) 

If the mathematics of fractal geometry is to be really worthwhile, then it 
should be applicable to physical situations. Considerable progress is being made 
in this direction and some examples are given towards the end of this book. 
Although there are natural phenomena that have been explained in terms of fractal 
mathematics (Brownian motion is a good example), many applications tend to 
be descriptive rather than predictive. Much of the basic mathematics used in the 
study of fractals is not particularly new, though much recent mathematics has 
been specifically geared to fractals. For further progress to be made, development 
and application of appropriate mathematics remain a high priority. 
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Notes and references 

Unlike the rest of the book, which consists of fairly solid mathematics, this 
introduction contains some of the author's opinions and prejudices, which may 
well not be shared by other workers on fractals. Caveat emptor ! 

The foundational treatise on fractals, which may be appreciated at many levels, 
is the scientific, philosophical and pictorial essay of Mandelbrot (1982) (devel- 
oped from the original 1975 version), containing a great diversity of natural and 
mathematical examples. This essay has been the inspiration for much of the work 
that has been done on fractals. 

Many other books have been written on diverse aspects of fractals, and these 
are cited at the end of the appropriate chapters. Here we mention a selection with a 
broad coverage. Introductory treatments include Schroeder (1991), Moon (1992), 
Kaye (1994), Addison (1997) and Lesmoir-Gordon, Rood and Edney (2000). 
The volume by Peitgen, Jiirgens and Saupe (1992) is profusely illustrated with 
diagrams and examples, and the essays collated by Frame and Mandelbrot (2002) 
address the role of fractals in mathematics and science education. 

The books by Edgar (1990, 1998), Peitgen, Jiirgens and Saupe (1992) and Le 
Mehaute (1991) provide basic mathematical treatments. Falconer (1985a), Mat- 
tila (1995), Federer (1996) and Morgan (2000) concentrate on geometric measure 
theory, Rogers (1998) addresses the general theory of Hausdorff measures, and 
Wicks (1991) approaches the subject from the standpoint of non-standard anal- 
ysis. Books with a computational emphasis include Peitgen and Saupe (1988), 
Devaney and Keen (1989), Hoggar (1992) and Pickover (1998). The sequel to 
this book, Falconer (1997), contains more advanced mathematical techniques for 
studying fractals. 

Much of interest may be found in proceedings of conferences on fractal mathe- 
matics, for example in the volumes edited by Cherbit (1991), Evertsz, Peitgen and 
Voss (1995) and Novak (1998, 2000). The proceedings edited by Bandt, Graf and 
Zahle (1995, 2000) concern fractals and probability, those by Levy Vehel, Lutton 
and Tricot (1997), Dekking, Levy Vehel, Lutton and Tricot (1999) address engi- 
neering applications. Mandelbrot's 'Selecta' (1997, 1999, 2002) present a wide 
range of papers with commentaries which provide a fascinating insight into the 
development and current state of fractal mathematics and science. Edgar (1993) 
brings together a collection of classic papers on fractal mathematics. 

Papers on fractals appear in many journals; in particular the journal Fractals 
covers a wide range of theory and applications. 
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Chapter 1 Mathematical background 



This chapter reviews some of the basic mathematical ideas and notation that will 
be used throughout the book. Sections 1.1 on set theory and 1.2 on functions are 
rather concise; readers unfamiliar with this type of material are advised to consult 
a more detailed text on mathematical analysis. Measures and mass distributions 
play an important part in the theory of fractals. A treatment adequate for our 
needs is given in Section 1.3. By asking the reader to take on trust the existence 
of certain measures, we can avoid many of the technical difficulties usually 
associated with measure theory. Some notes on probability theory are given in 
Section 1.4; an understanding of this is needed in Chapters 15 and 16. 



1.1 Basic set theory 

In this section we recall some basic notions from set theory and point set topology. 
We generally work in n-dimensional Euclidean space, W 1 , where R. 1 — R. is 
just the set of real numbers or the 'real line', and IR 2 is the (Euclidean) plane. 
Points in IR" will generally be denoted by lower case letters x, y, etc., and we will 
occasionally use the coordinate form x — (x\, . . . , x n ), y — (yi, . . . , y n ). Addi- 
tion and scalar multiplication are defined in the usual manner, so that x + y — 
(x\ + yi, . . . , x n + y n ) and Xx — (Xxi, . . . , Xx n ), where A is a real scalar. We use 
the usual Euclidean distance or metric on E." . So if x, y are points of E.", the dis- 
tance between them is \x — y\ — (X^=i \ x i ~ yA 2 ) ■ m particular, we have the 
triangle inequality |x + ;y|^|x|-|-|;y|, the reverse triangle inequality \x — y\ > 

| jc | — \y\ , and the metric triangle inequality \x — y\ ^ \x — z\ + \z — y\, for all 

x,y,z e K". 

Sets, which will generally be subsets of IR", are denoted by capital letters E, 
F, U, etc. In the usual way, x e E means that the point x belongs to the set E, 
and E C F means that £ is a subset of the set F. We write [x : condition} for 
the set of x for which 'condition' is true. Certain frequently occurring sets have a 
special notation. The empty set, which contains no elements, is written as 0. The 
integers are denoted by J. and the rational numbers by Q. We use a superscript 
+ to denote the positive elements of a set; thus IR + are the positive real numbers, 
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and Z + are the positive integers. Occasionally we refer to the complex numbers 
C, which for many purposes may be identified with the plane IR 2 , with X\ + 1x2 
corresponding to the point (x 1; Xq). 

The closed ball of centre x and radius r is defined by B(x, r) — [y : \y — x\ 
^ r}. Similarly the open ball is B°(x, r) — {y : \y — x\ < r). Thus the closed 
ball contains its bounding sphere, but the open ball does not. Of course in IR 2 a 
ball is a disc and in IR 1 a ball is just an interval. If a < b we write [a, b] for the 
closed interval [x : a ^ x ^ b] and (a, b) for the open interval [x : a < x < b] . 
Similarly [a, b) denotes the half-open interval {x : a ^ x < b}, etc. 

The coordinate cube of side 2r and centre x — (x\, . . . , x n ) is the set [y — 
(yi, . . . , y„) : |y,- — Xt\ ^ r for all i = 1, . . . , n). (A cube in IR 2 is just a square 
and in IR 1 is an interval.) 

From time to time we refer to the ^-neighbourhood or ^-parallel body, A s , of 
a set A, that is the set of points within distance S of A; thus A s — [x : \x — y\ ^ 

5 for some y in A}; see figure 1.1. 

We write AU B for the union of the sets A and B, i.e. the set of points belong- 
ing to either A or B, or both. Similarly, we write A l~l B for their intersection, 
the points in both A and B. More generally, [J a A a denotes the union of an 
arbitrary collection of sets {A a }, i.e. those points in at least one of the sets A a , 
and P| A a denotes their intersection, consisting of the set of points common to 
all of the A a . A collection of sets is disjoint if the intersection of any pair is the 
empty set. The difference A\B of A and B consists of the points in A but not 
B. The set IR"\A is termed the complement of A. 

The set of all ordered pairs {{a, b) : a e A and b e B) is called the (Cartesian) 
product of A and B and is denoted by A x B. If A C fl&" and B C IR m then 
Ax B C \& n+m . 

If A and B are subsets of IR" and A is a real number, we define the vector 
sum of the sets asA + ,B = {x-|-y:xeA and y e B) and we define the scalar 
multiple XA — {kx : x e A}. 

An infinite set A is countable if its elements can be listed in the form X\ , X2, ... 
with every element of A appearing at a specific place in the list; otherwise the 



■i 'h 




Figure 1.1 A set A and its <5 -neighbourhood A s 
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set is uncountable . The sets Z and Q are countable but E. is uncountable. Note 
that a countable union of countable sets is countable. 

If A is any non-empty set of real numbers then the supremum sup A is the 
least number m such that x < m for every x in A, or is oo if no such number 
exists. Similarly, the infimum inf A is the greatest number m such that m SC x for 
all x in A, or is — oo. Intuitively the supremum and infimum are thought of as the 
maximum and minimum of the set, though it is important to realize that sup A 
and inf A need not be members of the set A itself. For example, sup(0, 1) = 1, 
but 1 ^ (0, 1). We write sup^^C ) for the supremum of the quantity in brackets, 
which may depend on x, as x ranges over the set B. 

We define the diameter \A\ of a (non-empty) subset of W as the greatest 
distance apart of pairs of points in A. Thus \A\ — sup{|jc — y\ : x, y e A}. In W 
a ball of radius r has diameter 1r, and a cube of side length 8 has diameter &*Jn. 
A set A is bounded if it has finite diameter, or, equivalently, if A is contained 
in some (sufficiently large) ball. 

Convergence of sequences is defined in the usual way. A sequence {xk} in 
W converges to a point x of W as k — > oo if, given e > 0, there exists a 
number K such that \Xk — x\ < s whenever k > K, that is if \xi — x\ converges 
to 0. The number x is called the limit of the sequence, and we write Xk —> x or 
lim^oo** = x. 

The ideas of 'open' and 'closed' that have been mentioned in connection with 
balls apply to much more general sets. Intuitively, a set is closed if it contains 
its boundary and open if it contains none of its boundary points. More precisely, 
a subset A of W is open if, for all points x in A there is some ball B(x, r), 
centred at x and of positive radius, that is contained in A. A set is closed if, 
whenever [xk] is a sequence of points of A converging to a point x of IR", then 
x is in A; see figure 1.2. The empty set and IR" are regarded as both open 
and closed. 

It may be shown that a set is open if and only if its complement is closed. The 
union of any collection of open sets is open, as is the intersection of any finite 
number of open sets. The intersection of any collection of closed sets is closed, 
as is the union of any finite number of closed sets, see Exercise 1.6. 

A set A is called a neighbourhood of a point x if there is some (small) ball 
B(x, r) centred at x and contained in A. 




(a) (b) (c) 

Figure 1.2 (a) An open set — there is a ball contained in the set centred at each point of 
the set. (b) A closed set — the limit of any convergent sequence of points from the set 
lies in the set. (c) The boundary of the set in (a) or (b) 
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The intersection of all the closed sets containing a set A is called the closure 
of A, written A. The union of all the open sets contained in A is the interior 
int(A) of A. The closure of A is thought of as the smallest closed set containing 
A, and the interior as the largest open set contained in A. The boundary 3 A of A 
is given by 3 A = A\int(A), thus x e 3 A if and only if the ball B(x, r) intersects 
both A and its complement for all r > 0. 

A set B is a dense subset of A if B C A C B, i.e. if there are points of B 
arbitrarily close to each point of A. 

A set A is compact if any collection of open sets which covers A (i.e. with 
union containing A) has a finite subcollection which also covers A. Technically, 
compactness is an extremely useful property that enables infinite sets of condi- 
tions to be reduced to finitely many. However, as far as most of this book is 
concerned, it is enough to take the definition of a compact subset of IR" as one 
that is both closed and bounded. 

The intersection of any collection of compact sets is compact. It may be shown 
that if Ai D A2 D ■ ■ ■ is a decreasing sequence of compact sets then the intersec- 
tion Pl^i At is non-empty, see Exercise 1.7. Moreover, if H^i ^i * s contained 
in V for some open set V, then the finite intersection P)i=i A' is contained in V 
for some k. 

A subset A of R" is connected if there do not exist open sets U and V such that 
U U V contains A with ACMJ and AH V disjoint and non-empty. Intuitively, 
we think of a set A as connected if it consists of just one 'piece' . The largest 
connected subset of A containing a point x is called the connected component 
of x. The set A is totally disconnected if the connected component of each point 
consists of just that point. This will certainly be so if for every pair of points x 
and y in A we can find disjoint open sets U and V such that x e U, y e V and 
AcUUV. 

There is one further class of set that must be mentioned though its precise 
definition is indirect and should not concern the reader unduly. The class of Borel 
sets is the smallest collection of subsets of R" with the following properties: 

(a) every open set and every closed set is a Borel set; 

(b) the union of every finite or countable collection of Borel sets is a Borel 
set, and the intersection of every finite or countable collection of Borel 
sets is a Borel set. 

Throughout this book, virtually all of the subsets of R n that will be of any 
interest to us will be Borel sets. Any set that can be constructed using a sequence 
of countable unions or intersections starting with the open sets or closed sets will 
certainly be Borel. The reader will not go far wrong in work of the sort described 
in this book by assuming that all the sets encountered are Borel sets. 

1.2 Functions and limits 

Let X and Y be any sets. A mapping, function or transformation f from X to Y 
is a rule or formula that associates a point f(x) of Y with each point x of X. 
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We write / : X — > Y to denote this situation; X" is called the domain of / and 
Y is called the codomain. If A is any subset of X we write /(A) for the image 
of A, given by {/(#) : x e A}. If 5 is a subset of y, we write /(B) for the 
inverse image or pre-image of 5, i.e. the set {x e X : /(x) e /?}; note that in 
this context the inverse image of a single point can contain many points. 

A function / : X — > Y is called an injection or a one-to-one function if 
fix) ■£ f(y) whenever x ^ y, i.e. different elements of X are mapped to dif- 
ferent elements of Y. The function is called a surjection or an onto function 
if, for every y in Y, there is an element x in X with f(x) — y, i.e. every ele- 
ment of Y is the image of some point in X. A function that is both an injection 
and a surjection is called a bijection or one-to-one correspondence between X 
and Y. If / : X — > Y is a bijection then we may define the inverse function 
f : Y — > X by taking / _1 (y) as the unique element of X such that fix) — y. 
In this situation, f~ x ifix)) — x for x in X and f(f~ l (y)) — y for y in Y. 

The composition of the functions / : X — > 7 and g : 7 — > Z is the func- 
tion goy ; X — > Z given by (g°/)(x) = g(/(x)). This definition extends to the 
composition of any finite number of functions in the obvious way. 

Certain functions from W to E." have a particular geometric significance; often 
in this context they are referred to as transformations and are denoted by capital 
letters. Their effects are shown in figure 1.3. The transformation S : E. n ^ E." is 
called a congruence or isometry if it preserves distances, i.e. if \S(x) — Siy)\ — 
\x — y\ for x, y in W. Congruences also preserve angles, and transform sets 
into geometrically congruent ones. Special cases include translations, which are 






Direct congruence 
or rigid motion 



(Indirect) congruence 





Similarities 




Affinities 
Figure 1.3 The effect of various transformations on a set A 
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of the form S(x) — x + a and have the effect of shifting points parallel to the 
vector a, rotations which have a centre a such that \S(x) — a\ — \x — a\ for all 
x (for convenience we also regard the identity transformation given by I (x) — x 
as a rotation) and reflections which map points to their mirror images in some 
(n — 1) -dimensional plane. A congruence that may be achieved by a combination 
of a rotation and a translation, i.e. does not involve reflection, is called a rigid 
motion or direct congruence. A transformation 5 : IR" — ^ W is a similarity of 
ratio or scale c > if \S{x) — S{y)\ — c\x — y\ for all x, y in E." . A similarity 
transforms sets into geometrically similar ones with all lengths multiplied by the 
factor c. 

A transformation 7" : [R.™ — >- IK." is linear if T(x + y) — T(x) + T(y) and 
T(Xx) — XT(x) for all x,y e E." and X e IR; linear transformations may be 
represented by matrices in the usual way. Such a linear transformation is non- 
singular if T{x) — if and only if x = 0. If S : W —>■ W is of the form 
S(x) — T(x) + a, where T is a non-singular linear transformation and a is a 
point in R n , then S is called an affine transformation or an affinity. An affinity 
may be thought of as a shearing transformation; its contracting or expanding effect 
need not be the same in every direction. However, if T is orthonormal, then S 
is a congruence, and if T is a scalar multiple or an orthonormal transformation 
then T is a similarity. 

It is worth pointing out that such classes of transformation form groups under 
composition of mappings. For example, the composition of two translations is a 
translation, the identity transformation is trivially a translation, and the inverse of 
a translation is a translation. Finally, the associative law S°(T°U) — (S°T)oU 
holds for all translations S, T, U . Similar group properties hold for the congru- 
ences, the rigid motions, the similarities and the affinities. 

A function / : X — >■ Y is called a Holder function of exponent a if 

\f(x)-f(y)\^c\x-y\ a (x,yeX) 

for some constant c ^ 0. The function / is called a Lip schitz function if a may 
be taken to be equal to 1, that is if 

l/W-/(y)Kc|x-v| (x,yeX) 

and a bi- Lip schitz function if 

d\x - vK [/(*) - f(y)\ < c 2 \x - y\ (x, yeX) 

for < C\ ^ C2 < oo, in which case both / and / _1 : f(X) — > X are Lipschitz 
functions. 

We next remind readers of the basic ideas of limits and continuity of functions. 
Let X and Y be subsets of R" and IR m respectively, let / : X — > Y be a function, 
and let a be a point of X. We say that f{x) has limit y (or tends to y, or converges 
to y) as x tends to a, if, given e > 0, there exists S > such that \f(x) — y\ < s 
for all x e X with \x — a\ < S. We denote this by writing f{x) — > y as x — > a 
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or by lim A ^ a fix) — y. For a function / : X —> R. we say that fix) tends to 
infinity (written fix) — > oo) as x —> a if, given M, there exists S > such that 
fix) > M whenever \x — a\ < 8. The definition of fix) — > — oo is similar. 

Suppose, now, that / : \$L + — > R. We shall frequently be interested in the 
values of such functions for small positive values of x. Note that if fix) is 
increasing as x decreases, then lirn^o fix) exists either as a finite limit or as 
oo, and if fix) is decreasing as x decreases then lim A ^ fix) exists and is finite 
or — oo. Of course, fix) can fluctuate wildly for small x and lim^o fix) need 
not exist at all. We use lower and upper limits to describe such fluctuations. We 
define the lower limit as 



lim fjx) 



lim(inf{/(x) : < x < r}). 

r— ±0 



Since inf{/(jc) : < x < r] is either — oo for all positive r or else increases as 
r decreases, lim t _ >0 fix) always exists. Similarly, the upper limit is defined as 



lim/(x) 



lim(sup{/(x) : < x < r\). 



The lower and upper limits exist (as real numbers or — oo or oo) for every function 
/, and are indicative of the variation in values of / for x close to 0; see figure 1.4. 
Clearly, lim T , f (x) ^ lirn^o fix); if the lower and upper limits are equal, then 
lim x ^o fix) exists and equals this common value. Note that if fix) < gix) for 
x > then lim vQ f jx) ^ lim ^ six) and lim A _^o/(x) $; \im x ^ogix). In the 
same way, it is possible to define lower and upper limits as x — > a for functions 
/ : X — >■ E. where X is a subset of W with a in X. 

We often need to compare two functions /, g : K + — > E. for small values. 
We write fix) ~ g(x) to mean that fix)/ gix) — >• 1 as x — > 0. We will often 



lim f(x) 



lim f(x) 
x^O 




x 

Figure 1.4 The upper and lower limits of a function 
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have that fix) ~ x s \ in other words that / obeys an approximate power law of 
exponent 5 when x is small. We use the notation f(x) — g(x) more loosely, to 
mean that fix) and g(x) are approximately equal in some sense, to be specified 
in the particular circumstances. 

Recall that function / : X — > Y is continuous at a point a of X if f(x) —> f(a) 
as x -¥ a, and is continuous on X if it is continuous at all points of X. In 
particular, Lipschitz and Holder mappings are continuous. If / : X —*■ Y is a 
continuous bijection with continuous inverse f~ l : Y —> X then / is called a 
homeomorphism, and X and Y are termed homeomorphic sets. Congruences, 
similarities and affine transformations on W are examples of homeomorphisms. 

The function / : U. — > K is differentiable at x with the number f'(x) as deriva- 
tive if 

,. fix + h)- f(x) 

hm = / (x). 

h^O h 

In particular, the mean value theorem applies: given a < b and / differentiable 
on [a, b] there exists c with a < c < b such that 

fib) - fja) 

u = f (c) 

b — a 



(intuitively, any chord of the graph of / is parallel to the slope of / at some inter- 
mediate point). A function / is continuously differentiable if fix) is continuous 
in x. 

More generally, if / : W — > R", we say that / is differentiable at x with 
derivative the linear mapping fix) : W — > K" if 

,. \fix + h)-fjx)-fjx)h\ 

hm = 0. 

IftM-o \h\ 

Occasionally, we shall be interested in the convergence of a sequence of 
functions f : X — »■ Y where X and Y are subsets of Euclidean spaces. We 
say that functions f converge pointwise to a function / : X — > Y if fk(x) — > fix) 
as k — >• oo for each i in X. We say that the convergence is uniform if 
sup xeX \fk(x) — f(x)\ —*■ as k — > oo. Uniform convergence is a rather stronger 
property than pointwise convergence; the rate at which the limit is approached 
is uniform across X. If the functions f are continuous and converge uniformly 
to /, then / is continuous. 

Finally, we remark that logarithms will always be to base e. Recall that, for 
a, b > 0, we have that \ogab — loga + logb, and that loga c = clog a for real 
numbers c. The identity a c — t> cl ° ga ^ losh will often be used. The logarithm is the 
inverse of the exponential function, so that e logx — x, for x > 0, and loge y = y 
for y e\H. 
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1.3 Measures and mass distributions 

Anyone studying the mathematics of fractals will not get far before encountering 
measures in some form or other. Many people are put off by the seemingly 
technical nature of measure theory — often unnecessarily so, since for most fractal 
applications only a few basic ideas are needed. Moreover, these ideas are often 
already familiar in the guise of the mass or charge distributions encountered in 
basic physics. 

We need only be concerned with measures on subsets of W. Basically a 
measure is just a way of ascribing a numerical 'size' to sets, such that if a set 
is decomposed into a finite or countable number of pieces in a reasonable way, 
then the size of the whole is the sum of the sizes of the pieces. 

We call fi a measure on W if /i assigns a non-negative number, possibly oo, 
to each subset of W such that: 

(a) n(0) = O; (1.1) 

(b) fi(A) sC ijl(B) if Ac B; (1.2) 

(c) if A i, A2, ... is a countable (or finite) sequence of sets then 



/i 



\JaA ^J2 ^ a <) (°) 



m=\ 1 i=\ 



with equality in (1.3), i.e. 



IX 



(OO \ OO 

\jAi\ =J^ M(A«), (1.4) 

;=1 / i=\ 



if the A, are disjoint Borel sets. 

We call ijl{A) the measure of the set A, and think of 11(A) as the size of A 
measured in some way. Condition (a) says that the empty set has zero measure, 
condition (b) says 'the larger the set, the larger the measure' and (c) says that if 
a set is a union of a countable number of pieces (which may overlap) then the 
sum of the measure of the pieces is at least equal to the measure of the whole. 
If a set is decomposed into a countable number of disjoint Borel sets then the 
total measure of the pieces equals the measure of the whole. 

Technical note. For the measures that we shall encounter, (1.4) generally holds 
for a much wider class of sets than just the Borel sets, in particular for all 
images of Borel sets under continuous functions. However, for reasons that need 
not concern us here, we cannot in general require that (1.4) holds for every 
countable collection of disjoint sets A,. The reader who is familiar with measure 
theory will realize that our definition of a measure on W 1 is the definition of 
what would normally be termed 'an outer measure on E." for which the Borel 
sets are measurable'. However, to save frequent referral to 'measurable sets' it 
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is convenient to have /x(A) defined for every set A, and, since we are usually 
interested in measures of Borel sets, it is enough to have (1.4) holding for Borel 
sets rather than for a larger class. If /x is defined and satisfies (1.1)— (1.4) for 
the Borel sets, the definition of /x may be extended to an outer measure on all 
sets in such a way that (1.1)— (1.3) hold, so our definition is consistent with the 
usual one. 

If A D B then A may be expressed as a disjoint union A — B U (A\B), so it 
is immediate from (1.4) that, if A and B are Borel sets, 

lx(A\B) = n(A) - n(B). (1.5) 

Similarly, if A\ C A 2 C ■ ■ ■ is an increasing sequence of Borel sets then 

.Um M (A,) = /x((jA,). (1.6) 

To see this, note that |J~i A; = A x U (A 2 \A 1 ) U (A 3 \A 2 ) U . . ., with this union 
disjoint, so that 

(00 \ oo 

|J A, j = ^{A V ) + J2 (MA+l) ~ fl(Ai)) 
( = 1 / ( = 1 

k 

= fi(Ai) + lim V (m(A,+i) - /x(A,)) 

i=\ 

— lim /x(Aj.). 

k^oo 

More generally, it can be shown that if, for 8 > 0, Ag are Borel sets that are 
increasing as S decreases, i.e. Ag> C Ag for < S < 8' , then 

limfi(A s ) = ,illjA s y (1.7) 

\s>o I 

We think of the support of a measure as the set on which the measure is 
concentrated. Formally, the support of //, written spt/x, is the smallest closed set 
X such that fi(E. n \X) — 0. The support of a measure is always closed and x is 
in the support if and only if n(B(x,r)) > for all positive radii r. We say that 
/x is a measure on a set A if A contains the support of jjl. 

A measure on a bounded subset of W 1 for which < /x(IR") < oo will be 
called a mass distribution, and we think of /x(A) as the mass of the set A. We 
often think of this intuitively: we take a finite mass and spread it in some way 
across a set X to get a mass distribution on X; the conditions for a measure will 
then be satisfied. 
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We give some examples of measures and mass distributions. In general, we 
omit the proofs that measures with the stated properties exist. Much of technical 
measure theory concerns the existence of such measures, but, as far as applica- 
tions go, their existence is intuitively reasonable, and can be taken on trust. 



Example 1.1. The counting measure 

For each subset A of W let 11(A) be the number of points in A if A is finite, 
and 00 otherwise. Then /x is a measure on U. n . 



Example 1.2. Point mass 

Let a be a point in W and define /i(A) to be 1 if A contains a, and otherwise. 
Then \i is a mass distribution, thought of as a point mass concentrated at a. 



Example 1.3. Lebesgue measure on R 

Lebesgue measure C l extends the idea of 'length' to a large collection of sub- 
sets of W, that includes the Borel sets. For open and closed intervals, we take 
C l (a, b) — C l \a, b] — b — a. If A = U ; [fl,', b{\ is a finite or countable union of 
disjoint intervals, we let C 1 (A) = XX^j — a i) be the length of A thought of as the 
sum of the length of the intervals. This leads us to the definition of the Lebesgue 
measure C l (A) of an arbitrary set A. We define 



C\A) = inf IJ^ibi - ad : A c \J[a t , b t ] 



that is, we look at all coverings of A by countable collections of intervals, and 
take the smallest total interval length possible. It is not hard to see that (1.1)— (1.3) 
hold; it is rather harder to show that (1.4) holds for disjoint Borel sets A,, and 
we avoid this question here. (In fact, (1.4) holds for a much larger class of sets 
than the Borel sets, 'the Lebesgue measurable sets', but not for all subsets of KL) 
Lebesgue measure on E. is generally thought of as 'length', and we often write 
length (A) for C l (A) when we wish to emphasize this intuitive meaning. 

Example 1.4. Lebesgue measure on R" 

If A = {(X\, . . . , x n ) e W : a,- ^ Xi ^ b{\ is a 'coordinate parallelepiped' in W\ 
the M-dimensional volume of A is given by 

vol" (A) = (b x - ai)(b 2 - a 2 ) • ■ ■ (b n - a n ). 

(Of course, vol 1 is length, as in Example 1.3, vol 2 is area and vol 3 is the usual 3- 
dimensional volume.) Then n-dimensional Lebesgue measure C may be thought 



14 Mathematical background 

of as the extension of n -dimensional volume to a large class of sets. Just as in 
Example 1.3, we obtain a measure on IR" by defining 

Ioo oo 1 

J2 vol" (A,): AC (J A, 
;=i i=i J 

where the infimum is taken over all coverings of A by coordinate parallelepipeds 
A,. We get that C (A) = vol" (A) if A is a coordinate parallelepiped or, indeed, 
any set for which the volume can be determined by the usual rules of mensuration. 
Again, to aid intuition, we sometimes write area (A) in place of £ 2 (A), vol(A) 
for £ 3 (A) and vol" (A) for C n (A). 

Sometimes, we need to define '^-dimensional' volume on a ^-dimensional 
plane X in IR"; this may be done by identifying X with U. k and using C k on 
subsets of X in the obvious way. 

Example 1.5. Uniform mass distribution on a line segment 

Let L be a line segment of unit length in the plane. Define /x(A) = C l {L H A) 
i.e. the 'length' of intersection of A with L. Then /x is a mass distribution with 
support L, since /u,(A) = if A n L — 0. We may think of /x as unit mass spread 
evenly along the line segment L. 

Example 1.6. Restriction of a measure 

Let \x be a measure on IR" and E a Borel subset of IR". We may define a measure 
v on IR", called the restriction of jjl to E, by v(A) — n(E D A) for every set A. 
Then v is a measure on IR" with support contained in E. 

As far as this book is concerned, the most important measures we shall meet are 
the s-dimensional Hausdorff measures TL S on subsets of IR", where < s ^ n. 
These measures, which are introduced in Section 2.1, are a generalization of 
Lebesgue measures to dimensions that are not necessarily integral. 

The following method is often used to construct a mass distribution on a subset 
of IR". It involves repeated subdivision of a mass between parts of a bounded 
Borel set E. Let £q consist of the single set E. For k — 1,2,... we let ^ be a 
collection of disjoint Borel subsets of E such that each set U in £t is contained 
in one of the sets of £-k-\ an d contains a finite number of the sets in £k+\- We 
assume that the maximum diameter of the sets in 6^ tends to as k — > oo. We 
define a mass distribution on E by repeated subdivision; see figure 1.5. We let 
/J.(E) satisfy < fi(E) < oo, and we split this mass between the sets U\ , . . . , U m 
in E\ by defining /x(t/,) in such a way that Y17=i ^(^i) — £*•(£)• Similarly, we 
assign masses to the sets of £2 so that if U\ , . . . , U m are the sets of £2 contained 
in a set U of £\, then Y17=i fliUi) — l^(U). In general, we assign masses so that 

J^ix(Ui) = fx(U) (1.8) 
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Figure 1.5 Steps in the construction of a mass distribution /x by repeated subdivision. 
The mass on the sets of £j is divided between the sets of £k+\\ so, for example, 
H(U) = n(Ui) + n(U 2 ) 

for each set U of £/ i , where the {(/,} are the disjoint sets in £ k +\ contained in U . 
For each k, we let E^ be the union of the sets in £#, and we define /x(A) = for 
all A with AD E k =0. 

Let £ denote the collection of sets that belong to £% for some k together with 
the subsets of W\E k . The above procedure defines the mass n(A) of every set A 
in £, and it should seem reasonable that, by building up sets from the sets in £ , it 
specifies enough about the distribution of the mass jjl across E to determine /x(A) 
for any (Borel) set A. This is indeed the case, as the following proposition states. 



Proposition 1.7 

Let [i be defined on a collection of sets £ as above. Then the definition of \x 
may be extended to all subsets of W 1 so that /x becomes a measure. The value of 
/x(A) is uniquely determined if A is a Borel set. The support of \i is contained in 

nT =i E k . 

Note on Proof. If A is any subset of W, let 



H(A) — inf 



I 53 A* W) -AclJUi and Ui 



e£ 



(1.9) 
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(Thus we take the smallest value we can of $Z;=i l^(Ui) where the sets (/,- are in 
£ and cover A; we have already defined /x(i/,) for such (/,.) It is not difficult to 
see that if A is one of the sets in £, then (1.9) reduces to the mass /x(A) specified 
in the construction. The complete proof that \± satisfies all the conditions of a 
measure and that its values on the sets of £ determine its values on the Borel 
sets is quite involved, and need not concern us here. Since n(E."\Ek) — 0, we 
have /x(A) = if A is an open set that does not intersect E^ for some k, so the 
support of /x is in E^ for all k. □ 

Example 1.8 

Let £i denote the collection of 'binary intervals' of length 2~ k of the form 
[rl~ k , (r + 1)2"*) where ^ r sC 2 k - 1. If we take ii[r2~ k , (r + 1)2"*) = 2"* 
in the above construction, we get that /x is Lebesgue measure on [0, 1], 

Note on calculation. Clearly, if / is an interval in £ k of length 2~ k and l\ , I 2 are 
the two subintervals of / in £^ +1 of length 2~* _1 , we have n(I) — n(I\) + H-ih) 
which is (1.8). By Proposition 1.7 /x extends to a mass distribution on [0, 1]. We 
have /x(/) = length (/) for / in £, and it may be shown that this implies that /x 
coincides with Lebesgue measure on any set. □ 

We say that a property holds for almost all x, or almost everywhere (with 
respect to a measure /x) if the set for which the property fails has /x-measure 
zero. For example, we might say that almost all real numbers are irrational with 
respect to Lebesgue measure. The rational numbers Q are countable; they may 
be listed as x\, X2, ■ ■ ., say, so that £ l (Q.) — XXi ^{Xi} — 0- 

Although we shall usually be interested in measures in their own right, we 
shall sometimes need to integrate functions with respect to measures. There are 
technical difficulties concerning which functions can be integrated. We may get 
around these difficulties by assuming that, for /:£)—> R a function defined on 
a Borel subset D of W, the set f~ l (— oo, a] — {x e D : f(x) ^ a] is a Borel 
set for all real numbers a. A very large class of functions satisfies this condition, 
including all continuous functions (for which / (— oo, a] is closed and therefore 
a Borel set). We make the assumption throughout this book that all functions to 
be integrated satisfy this condition; certainly this is true of functions that are 
likely to be encountered in practice. 

To define integration we first suppose that / : D — > E. is a simple function, 
i.e. one that takes only finitely many values a\, . . . , a^. We define the integral 
with respect to the measure /x of a non-negative simple function / as 



f k 



The integral of more general functions is defined using approximation by simple 
functions. If / : D — > E. is a non-negative function, we define its integral as 



/ 



/ d^x = sup / g d/x : g is simple, ^ g ^ f 
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To complete the definition, if / takes both positive and negative values, we 
let /+(*) = max{/(x), 0} and /_(*) = max{ -/(*), 0}, so that f = f+ - /_, 
and define 

/ fdn = / f+dn - / /_ dn 

provided that f f + d/x and f /_ d/x are both finite. 

All the usual properties hold for integrals, for example, 

j(f + g)d[l = jfd,l + j g dtL 



and 



j kfdn = xj 



fdn 



if A is a scalar. We also have the monotone convergence theorem, that if 
fa : D — > K is an increasing sequence of non-negative functions converging 
(pointwise) to /, then 

lim f k dn= /d/x. 

k-*ooJ J 

If A is a Borel subset of D, we define integration over the set A by 

/ /d/x = / fXAdti 

where Xa : i&" — ► i& is the 'indicator function' such that Xa( x ) — 1 if X is in A 
and Xa( x ) — otherwise. 

Note that, if f(x) ^ and f /d/x = 0, then f(x) — for /x-almost all x. 

As usual, integration is denoted in various ways, such as //d/x, f f or 
f f(x)d/j J (x), depending on the emphasis required. When /x is n-dimensional 
Lebesgue measure £", we usually write f f dx or f f(x)dx in place of f f d£" . 

On a couple of occasions we shall need to use Egoroff s theorem. Let D be 
a Borel subset of W and /x a measure with /x(D) < oo. Let fi, fa, . . . and / 
be functions from D to K such that fa(x) — > f(x) for each x in D. Egoroff s 
theorem states that for any S > 0, there is a Borel subset E of D such that 
/i(D\E) < S and such that the sequence [fa] converges uniformly to / on E, 
i.e. with sup xe£ \fk( x ) — f(x)\ — > as k -¥ oo. For the measures that we shall 
be concerned with, it may be shown that we can always take the set E to 
be compact. 

1.4 Notes on probability theory 

For an understanding of some of the later chapters of the book, a basic knowledge 
of probability theory is necessary. We give a brief survey of the concepts needed. 
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Probability theory starts with the idea of an experiment or trial; that is, an 
action whose outcome is, for all practical purposes, not predetermined. Mathe- 
matically, such an experiment is described by a probability space, which has three 
components: the set of all possible outcomes of the experiment, the list of all the 
events that may occur as consequences of the experiment, and an assessment of 
likelihood of these events. For example, if a die is thrown, the possible outcomes 
are {1, 2, 3, 4, 5, 6}, the list of events includes 'a 3 is thrown', 'an even number 
is thrown', and 'at least a 4 is thrown'. For a 'fair die' it may be reasonable to 
assess the six possible outcomes as equally likely. 

The set of all possible outcomes of an experiment is called the sample space, 
denoted by Q. Questions of interest concerning the outcome of an experiment 
can always be phrased in terms of subsets of £2; in the above example 'is an 
odd number thrown?' asks 'is the outcome in the subset {1,3,5}?' Associat- 
ing events dependent on the outcome of the experiment with subsets of £2 in 
this way, it is natural to think of the union A U B as 'either A or B occurs', 
the intersection A D B as 'both A and B occur', and the complement £2\A as 
the event 'A does not occur', for any events A and B. In general, there is a 
collection T of subsets of Q that particularly interest us, which we call events. 
In the example of the die, T would normally be the collection of all subsets 
of £2, but in more complicated situations a relatively small collection of sub- 
sets might be relevant. Usually, T satisfies certain conditions; for example, if 
the occurrence of an event interests us, then so does its non-occurrence, so 
if A is in T, we would expect the complement Q\A also to be in T. We 
call a (non-empty) collection T of subsets of the sample space Q. an event 
space if 

Q\A e T whenever Ae T (1.10) 



and 

00 

(Ja,g.F whenever A,- e T (1 sC i < oo). (1.11) 

;=i 

It follows from these conditions that and Q are in T, and that A\B and 
ni=i At are in T whenever A, B and A, are in T. As far as our applications 
are concerned, we do not, in general, specify ^"precisely — this avoids technical 
difficulties connected with the existence of suitable event spaces. 

Next, we associate probabilities with the events of T, with P(A) thought of as 
the probability, or likelihood, that the event A occurs. We call P a probability 
or probability measure if P assigns a number P(A) to each A in T, such that the 
following conditions hold: 

^ P(A) sC 1 for all A e T (1.12) 

P(0) = and P(fi) = 1 (1.13) 
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and, if A\, Ai, ■ ■ . are disjoint events in T, 

(oo \ oo 

U AA =*£?&). (1.14) 

1=1 / 1=1 

It should seem natural for any definition of probability to satisfy these conditions. 

We call a triple (Q, T, P) a probability space if T is an event space of subsets 
of £2 and P is a probability measure defined on the sets of T. 

For the die-throwing experiment we might have £2 — {1, 2, 3, 4, 5, 6} with the 
event space consisting of all subsets of £2, and with P(A) = i x number of ele- 
ments in A. This describes the 'fair die' situation with each outcome equally 
likely. 

Often, £2 is an infinite set. For example we might have Q — [0, 1] and think 
of a random number drawn from [0, 1] with the probability of the number in a 
set A as P(A) = length (A). Here the event space might be the Borel subsets of 
[0, 1]. 

The resemblance of the definition of probability to the definition of a measure 
in (1.1)— (1.4) and the use of the term probability measure is no coincidence. Prob- 
abilities and measures may be put into the same context, with £2 corresponding 
to K" and with the event space corresponding to the Borel sets. 

In our applications later on in the book, we shall be particularly interested in 
events (on rather large sample spaces) that are virtually certain to occur. We say 
that an event A occurs with probability 1 or almost surely if P(A) = 1. 

Sometimes, we may possess partial information about the outcome of an exper- 
iment; for example, we might be told that the number showing on the die is even. 
This leads us to reassess the probabilities of the various events. If A and B are in 
T with P(B) > 0, the conditional probability of A given B, denoted by P(A|B), 
is defined by 

P(A n B) 
P(A|B) = — -. (1.15) 

This is thought of as the probability of A given that the event B is known to 
occur; as would be expected P{B\B) — 1. It is easy to show that (£2, T, P') is a 
probability space, where P'(A) = P(A|5). We also have the partition formula: if 
B\, Z?2, ■ ■ • are disjoint events with {J t B, = £2 and P(fi,) > for all i, then, for 
an event A, 

P(A) = J^P(A\B i )P(B i ). (1.16) 

In the case of the 'fair die' experiment, if B\ is the event 'an even number is 
thrown', Z?2 is 'an odd number is thrown' and A is 'at least 4 is thrown', then 

P(A|Bi) = P(4 or 6 is thrown)/P(2, 4 or 6 is thrown) = |/| = |. 

P(A|B 2 ) = P(5 is thrown)/P(l, 3 or 5 is thrown) = ±/| = \ 

from which (1.16) is easily verified. 
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We think of two events as independent if the occurrence of one does not affect 
the probability that the other occurs, i.e. if P(A\B) = P(A) and P(B\A) = P(B). 
Using (1.15), we are led to make the definition that two events A and B in a 
probability space are independent if 

P(A n B) = P(A)P(B). (1.17) 

More generally, an arbitrary collection of events is independent if for every finite 
subcollection [At : k e J] we have 



p (ri A *) =n p(At) - (ii8) 

In the die example, it is easy to see that 'a throw of at least 5' and 'an even 
number is thrown' are independent events, but 'a throw of at least 4' and 'an 
even number is thrown' are not. 

The idea of a random variable and its expectation (or average or mean) is 
fundamental to probability theory. Essentially, a random variable X is a real- 
valued function on a sample space. In the die example, X might represent the 
score on the die. Alternatively it might represent the reward for throwing a 
particular number, for example X(a>) = if a> = 1, 2, 3, or 4, X(5) — 1 and 
X(6) — 2. The outcome of an experiment determines a value of the random 
variable. The expectation of the random variable is the average of these values 
weighted according to the likelihood of each outcome. 

The precise definition of a random variable requires a little care. We say that X 
is a random variable on a probability space (£2, T, P) if X : £2 —> E. is a function 
such that X~ x ((— oo, a]) is an event in J 7 for each real number a; in other words, 
the set of co in £1 with X(co) ^ a is in the event space. This condition is equivalent 
to saying that X~ l (E) is in J 7 for any Borel set E. In particular, for any such E the 
probability that the random variable X takes a value in E, i.e. P({oj : X(a>) e E}), 
is defined. It may be shown that P({a> : X(a>) e E}) is determined for all Borel 
sets E from a knowledge of P({oo : X{oS) ^ a}) for each real number a. Note that 
it is usual to abbreviate expressions such as P({a> : X(co) e E}) to P(X e E). 

It is not difficult to show that if X and Y are random variables on (£1 , T, P) 
and A is a real number, then X + Y, X — Y, XY and XX are all random variables 
(these are defined in the obvious way, for example (X + Y)(a>) — X(a>) + Y(a>) 
for each co e Q). Moreover, if X\, X2, ... is a sequence of random variables with 
Xkico) increasing and bounded for each on, then lim^oo X^ is a random variable. 

A collection of random variables {Xt} is independent if, for any Borel sets 
Eii, the events {(X e E^)} are independent in the sense of (1.18); that is if, for 
every finite set of indices J, 

P(X k e E k for all k e /) = f] P{X k e E k ). 

keJ 
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Intuitively, X and Y are independent if the probability of Y taking any par- 
ticular value is unaffected by a knowledge of the value of X. Consider the 
probability space representing two successive throws of a die, with sample space 
{(x, y) : x, y — 1,2, ... ,6} and probability measure P defined by P{(x, y)} — -^ 
for each pair (x,y). If X and Y are the random variables given by the scores 
on successive throws, then X and Y are independent, modelling the assumption 
that one throw does not affect the other. However, X and X + Y are not inde- 
pendent — this reflects that the bigger the score for the first throw, the greater the 
chance of a high total score. 

The formal definition of the expectation of a random variable is analogous 
to the definition of the integral of a function; indeed, expectation is really the 
integral of the random variable with respect to the probability measure. Let X be 
a random variable on a probability space {Q,, T, P). First suppose that X(co) ^ 
for all co in Q, and that X takes only finitely many values X\ , . . . , X^\ we call such 
a random variable simple. We define the expectation, mean or average E(X) of 

X as 

k 

E(X) = J^x i P(X=x i ). (1.19) 

i=\ 

The expectation of an arbitrary random variable is defined using approximation 
by simple random variables. Thus for a non-negative random variable X 

E(X) — sup{E(y) : Y is a simple random variable 

with ^ Y(co) ^ X(co) for all a e £2}. 

Lastly, if X takes both positive and negative values, we let X + — max{X, 0} and 
X_ = max{-X, 0}, so that X — X + - X_, and define 

E(X) = E(X + ) - E(X_) 

provided that both E(X + ) < oo and E(X_) < oo. 

The random variable X representing the score of a fair die is a simple random 
variable, since X(a>) takes just the values 1, . . . , 6. Thus 



E(X) = £* x I = 3±. 



Expectation satisfies certain basic properties, analogous to those for the inte- 
gral. If X\, X2, ■ ■ ■ are random variables then 

E(X l +X 2 ) = E(X l ) + E(X 2 ) 

and, more generally, 

E (l>) = !>*<)• 
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If A is a constant 

E(XX) = XE(X) 

and if the sequence of non-negative random variables X\, X 2 , . . . is increasing 
with X — lim^oo X^ a (finite) random variable, then 

lim E(X k ) = E(X). 

Provided that Xy and X2 are independent, we also have 

E(X l X 2 ) = E(X l )E(X 2 ). 

Thus if X t represents that kth throw of a fair die in a sequence of throws, 
the expectation of the sum of the first k throws is E(X t + ■ ■ ■ + X k ) — E(X\) + 
••• + E(X t ) = 3i xJk. 

We define the conditional expectation E(X|J5) of X given an event B with 
P(B) > in a similar way, but starting with 

k 
E(X\B) = J^ Xi P(X = Xi \B) (1.20) 

;=i 

in place of (1.19). We get a partition formula resembling (1.16) 

E(X) = J^E(X\B i )P(B i ) (1.21) 

where B\, B 2 , . . . are disjoint events with (J. B, = f2 and P(B,) > 0. 

It is often useful to have an indication of the fluctuation of a random variable 
across a sample space. Thus we introduce the variance of the random variable 
X as 

var(X) = E((X - E(X)f) 
= E(X 2 ) - E(X) 2 

by a simple calculation. Using the properties of expectation, we get 

var(AX) = X 2 var(X) 

for any real number X, and 

var(X + Y) = var(X) + var(7) 

provided that X and Y are independent. 
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If the probability distribution of a random variable is given by an integral, i.e. 



-/. 



P(X ^x)= f(u)du (1.22) 

J — oo 

the function / is called the probability density function for X. It may be shown 
from the definition of expectation that 



E(X) 

and 

E(X 2 ) 



uf(u)du 

-oo 
f°° 2 

/ u f(u)du 

J — 00 



which allows var(X) = E(X 2 ) — E(X) 2 to be calculated. 

Note that the density function tells us about the distribution of the random 
variable X without reference to the underlying probability space, which, for 
many purposes, is irrelevant. We may express the probability that X belongs to 
any Borel set E in terms of the density function as 



P(X e E) 



/ f{u)du. 

J E 



We say that a random variable X has uniform distribution on the interval 
[a, b] if 

i r 

P(X ^x) = du (a < x < b). (1.23) 



Thus the probability of X lying in a subinterval of [a , b] is proportional to the 
length of the interval. In this case, we get that E(X) — j(a + b) and var(X) = 
±(b-a) 2 . 

A random variable X has normal or Gaussian distribution of mean m and 
variance a 2 if 

1 r (-(u-m) 2 \ 

P(X ^ x) = —= / exp \ . du. (1.24) 

o-V27rJ-oo V 2ct 2 / 

It may be verified by integration that E(X) — m and var(X) — cr 2 . If X\ and Xi 
are independent normally distributed random variables of means m\ and m^ and 
variances o 2 and er| respectively, then X\ + Xi is normal with mean m\ + m-i 
and variance o 2 + er|, and XX\ is normal with mean Xm\ and variance X 2 cr 2 , for 
any real number X. 

If we throw a fair die a large number of times, we might expect the average 
score thrown to be very close to 3i, the expectation or mean outcome of each 
throw. Moreover, the larger the number of throws, the closer the average should 
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be to the mean. This 'law of averages' is made precise as the strong law of 
large numbers. 

Let (Q, T, P) be a probability space. Let X\, X2, ... be random variables that 
are independent and that have identical distribution (i.e. for every set E, P(X, e E) 
is the same for all i), with expectation m and variance a 2 , both assumed finite. 
For each k we may form the random variable S^ = X\ + ■ ■ ■ + X^, so that the 
random variable (\/k)Sk is the average of the first k trials. The strong law of 
large numbers states that, with probability 1, 

1 
lim -St — m. (L25) 

k-toc k 

We can also say a surprising amount about the distribution of the random 
variable S^ when k is large. It may be shown that S* has approximately the 
normal distribution with mean km and variance ka 1 . This is the content of the 
central limit theorem, which states that, for every real number x, 

(S k -km \ f x 1 , , 

P — — - — sC x ^ / -^=exp(-±M 2 )dM as k -+ 00. (1.26) 

\ o-^/k ) J_ 00 V2tt 

An important aspect of the normal distribution now becomes clear — it is the form 
of distribution approached by sums of a large number of independent identically 
distributed random variables. 

We may apply these results to the experiment consisting of an infinite sequence 
of die throws. Let £1 be the set of all infinite sequences {a> — (a>\, 002, . . .) : 
a>i — 1, 2, . . . , 6} (we think of w, as the outcome of the kth throw). It is possible 
to define an event space T and probability measure P in such a way that for any 
given k and sequence o)\, . . . , &>* (&>,■ = 1, 2, . . . , 6), the event 'the first k throws 
are a>\, . . . , co^' is in J 7 and has probability (l)~ k . Let X^ be the random variable 
given by the outcome of the kth throw, so that X^im) — a>k- It is easy to see 
that the X k are independent and identically distributed, with mean m — 3j and 
variance 2-j^;- The strong law of large numbers tells us that, with probability 1, the 
average of the first k throws, S^/k converges to 3 5 as k tends to infinity, and the 
central limit theorem tells us that, when k is large, the sum S^ is approximately 
normally distributed, with mean 3 ^ x k and variance 2y| x k. Thus if we repeat 
the experiment of throwing k dice a large number of times, the sum of the k 
throws will have a distribution close to the normal distribution, in the sense 
of (1.26). 



1.5 Notes and references 

The material outlined in this chapter is covered at various levels of sophistication 
in numerous undergraduate mathematical texts. Almost any book on mathematical 
analysis, for example Rudin (1964) or Apostol (1974), contains the basic theory 
of sets and functions. A thorough treatment of measure and probability theory 
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may be found in Kingman and Taylor (1966), Billingsley (1995) and Edgar 
(1998). For probability theory, the book by Grimmett and Stirzaker (1992) may 
be found helpful. 



Exercises 

The following exercises do no more than emphasize some of the many facts that have 
been mentioned in this chapter. 

1.1 Verify that for x,y,zeR", (i) \x + yK |*| + |y|, (ii) \x - y\ > \\x\ - \y\\, 
(iii) \x-y\4 \x-z\ + \z-y\. 

1.2 Show from the definition of ^-neighbourhood that A s+ s' = (A s )s>- 

1.3 Show that a (non-empty) set is bounded if and only if it is contained in some ball 
B(0, r) with centre the origin. 

1.4 Determine which of the following sets are open and which are closed. In each 
case determine the interior and closure of the set. (i) A non-empty finite set A, 
(ii) the interval (0, 1), (iii) the interval [0, 1], (iv) the interval [0, 1), (v) the set 
10 1 i i i 1 

l"' l > 2' 3' 4' • • ■>■ 

1.5 Show that the middle third Cantor set, figure 0.1, is compact and totally discon- 
nected. What is its interior, closure and boundary? 

1.6 Show that the union of any collection of open subsets of R" is open and that the 
intersection of any finite collection of open sets is open. Show that a subset of WL" 
is closed if and only if its complement is open and hence deduce the corresponding 
result for unions and intersections of closed sets. 

1.7 Show that if A t D A 2 D ■ ■ ■ is a decreasing sequence of non-empty compact subsets 
of M." then Djt=i ^* * s a non-empty compact set. 

1.8 Show that the half-open interval (x e I : < x< 1) is a Borel subset of K. 

1.9 Let F be the set of numbers in [0, 1] whose decimal expansions contain the digit 5 
infinitely many times. Show that F is a Borel set. 

1.10 Show that the coordinate transformation of the plane 

X\ \ /ccos0 — csin6>\ ( X\ \ , ( a\ 

x 2 ) \CS,m9 C COS J \,X2 I \ «2 

is a similarity of ratio c, and describe the transformation geometrically. 

1.11 Find lim,^ f (x) and lim x ^ /(x) where / : K + -> K is given by: (i) sin(x); 
(ii) sin(l/x); (iii) x 2 + (3 + x) sin(l/x). 

1.12 Let f,g : [0, 1] — >• R be Lipschitz functions. Show that the functions defined on 
[0, 1] by f{x) + g(x) and f(x)g(x) are also Lipschitz. 

1.13 Let /: l-> Ibe differentiable with |/'(x)| ^ c for all x. Show, using the mean 
value theorem, that / is a Lipschitz function. 

1.14 Show that every Lipschitz function / : IR -> K is continuous. 

1.15 Let /:K^K be given by f(x)=x 2 +x. Find (i) / _1 (2), (ii) /~'(-2), 
(iii)/- 1 ([2, 6]). 
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1.16 Show that f(x) = x 2 is Lipschitz on [0, 2], bi-Lipschitz on [1, 2], and not Lipschitz 
on R. 

1.17 Show that if £ is a compact subset of K" and /:£—>• IR" is continuous, then /(£") 
is compact. 

1.18 Let A\,A2, . . ., be a decreasing sequence of Borel subsets of W and let A = 
n*li At- If M 1S a measure on K" with //(A]) < oo, show using (1.6) that [i(A k ) — » 
//(A) as k -> oo. 

1.19 Show that the point mass concentrated at a (see Example 1.2) is a measure. 

1.20 Show how to define a mass distribution on the middle third Cantor set, figure 0.1, 
in as uniform a way as possible. 

1.21 Verify that Lebesgue measure satisfies (1.1), (1.2) and (1.3). 

1.22 Let / : [0, 1] ->• R be a continuous function. For A a subset of K 2 define /x(A) = 
C{x : (x, fix)) e A}, where C is Lebesgue measure. Show that /x is a mass distri- 
bution on R 2 supported by the graph of /. 

1.23 Let D be a Borel subset of IR" and let /x be a measure on D with ji(D) < oo. Let 
fk '■ D -> WL be a sequence of functions such that fk(x) — > f(x) for all x in D. 
Prove Egoroff's theorem: that given e > there exists a Borel subset A of D with 
H(D\A) < s such that fk(x) converges to f(x) uniformly for x in A. 

1.24 Prove that if /x is a measure on D and / : D -> K satisfies f(x) > for all x in 
D and f D fdjj. = then f(x) = for /x-almost all x. 

1.25 If X is a random variable show that E((X - E(X)) 2 ) = E(X 2 ) - E(X) 2 (these num- 
bers equalling the variance of X). 

1.26 Verify that if X has the uniform distribution on [a, b] (see (1.23)) then E(X) = 
i(a + b) and var(X) = (b - a) 2 /l2. 

1.27 Let A\, A2, . . . be a sequence of independent events in some probability space such 
that P(Ak) = p for all k, where < p < 1. Let iV* be the random variable defined 
by taking Nk(a>) to equal the number of i with 1 < i < £ for which a> e A t . Use 
the strong law of large numbers to show that, with probability 1, N k / k ^> p as 
k — > 00. Deduce that the proportion of successes in a sequence of independent trials 
converges to the probability of success of each trial. 

1.28 A fair die is thrown 6000 times. Use the central limit theorem to estimate the 
probability that at least 1050 sixes are thrown. (A numerical method will be needed 
if the integral obtained is to be evaluated.) 



Chapter 2 Hausdorff measure 
and dimension 



The notion of dimension is central to fractal geometry. Roughly, dimension indi- 
cates how much space a set occupies near to each of its points. Of the wide variety 
of 'fractal dimensions' in use, the definition of Hausdorff, based on a construc- 
tion of Caratheodory, is the oldest and probably the most important. Hausdorff 
dimension has the advantage of being defined for any set, and is mathematically 
convenient, as it is based on measures, which are relatively easy to manipulate. 
A major disadvantage is that in many cases it is hard to calculate or to estimate 
by computational methods. However, for an understanding of the mathematics 
of fractals, familiarity with Hausdorff measure and dimension is essential. 



2.1 Hausdorff measure 

Recall that if U is any non-empty subset of n-dimensional Euclidean space, IR", 
the diameter of U is defined as \U\ — sup{|x — y\ : x, y e U}, i.e. the greatest 
distance apart of any pair of points in U. If {£/,} is a countable (or finite) collection 
of sets of diameter at most S that cover F, i.e. F C U;°^i ^i vvith < |t/,| ^ <5 
for each ;', we say that {(/,} is a 8-cover of F. 

Suppose that F is a subset of E." and s is a non-negative number. For any 
S > we define 



H S S (F) = inf | Y^ \Ui\ s : [Ui] is a S-cover of F 1 . (2.1) 



Thus we look at all covers of F by sets of diameter at most S and seek to minimize 
the sum of the .vth powers of the diameters (figure 2.1). As S decreases, the class 
of permissible covers of F in (2.1) is reduced. Therefore, the infimum H S S (F) 
increases, and so approaches a limit as S — > 0. We write 

H S (F) = limHUF). (2.2) 

3^0 
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Figure 2.1 A set F and two possible <5-covers for F. The infimum of E|£/,-|' v over all 
such 5-covers {[/,} gives W S (F) 



This limit exists for any subset F of R n , though the limiting value can be 
(and usually is) or oo. We call H S (F) the s-dimensional Hausdorff measure 
of F. 

With a certain amount of effort, TL S may be shown to be a measure; see 
section 1.3. It is straightforward to show that H s (0) — 0, that if E is con- 
tained in F then H S (E) < 7i s (F), and that if {F,} is any countable collection of 
sets, then 



WU F ') ^T,H s (Fi)- 



(2.3) 



\i=i 



It is rather harder to show that there is equality in (2.3) if the {F,} are disjoint 
Borel sets. 

Hausdorff measures generalize the familiar ideas of length, area, volume, 
etc. It may be shown that, for subsets of R n , n-dimensional Hausdorff mea- 
sure is, to within a constant multiple, just n-dimensional Lebesgue measure, 
i.e. the usual n-dimensional volume. More precisely, if F is a Borel subset of 
IR", then 



H n (F) = c- l vol n (F) 



(2.4) 



where c„ is the volume of an n-dimensional ball of diameter 1, so that c„ — 

7r" /2 /2"(n/2)! if n is even and c„ = 7r ( "- 1)/2 ((n - l)/2)!/n! if n is odd. Similarly, 
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Figure 2.2 Scaling sets by a factor X increases length by a factor X, area by a factor X 2 , 
and s -dimensional Hausdorff measure by a factor X s 



for 'nice' lower-dimensional subsets of R n , we have that H°(F) is the number 
of points in F; Ti l (F) gives the length of a smooth curve F; 7i 2 (F) — (4/jt) x 
area(F) if F is a smooth surface; 7Y 3 (F) = (6/jr) x vol(F); and TL m (F) — 
c~ l x vol m (F) if F is a smooth m-dimensional submanifold of E." (i.e. an m- 
dimensional surface in the classical sense). 

The scaling properties of length, area and volume are well known. On magni- 
fication by a factor X, the length of a curve is multiplied by X, the area of a plane 
region is multiplied by X 2 and the volume of a 3-dimensional object is multiplied 
by A 3 . As might be anticipated, s -dimensional Hausdorff measure scales with 
a factor X s (figure 2.2). Such scaling properties are fundamental to the theory 
of fractals. 

Scaling property 2.1 

Let S be a similarity transformation of scale factor X > 0. If F C R", then 



H S {S{F)) = X S H S {F). 



(2.5) 
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Proof. If \Ui) is a <5-cover of F then {5(£/,-)} is a AS-cover of S(F), so 

E|S(tfj)l'=A.'5W 

so 

H{ S (S(F)) ^ k s H s s (F) 

on taking the infimum. Letting 5^0 gives that H s (S(F)) $C V?i s (F). Replacing 
5 1 by S -1 , and so 1 by 1/X, and F by S(F) gives the opposite inequality required. 

□ 

A similar argument gives the following basic estimate of the effect of more 
general transformations on the Hausdorff measures of sets. 

Proposition 2.2 

Let F C U." and f : F — > R m be a mapping such that 

\f(x)-f(y)\^c\x-y\ a (x,yeF) (2.6) 

for constants c > and a > 0. Then for each s 

H s/a (f(F)) s: c s/a H s (F). (2.7) 

Proof. If{£/;}isa(S-coverofF,then, since |/(F n t/j)| sC c\FnUi\ a ^ c|C/,| a ,it 
follows that {/(F n £/,-)} is an e-cover of /(F), where e = c<5". Thus 
Ej l/(F n ^)l' /o ^ C J /" £,. \Ut\ s , so that H s J a (f(F)) ^ c s l<"Hl(F). As 5 -> 0, 
so e — > 0, giving (2.7). D 

Condition (2.6) is known as a Holder condition of exponent a; such a condition 
implies that / is continuous. Particularly important is the case a = 1, i.e. 

l/«-/O0Kc|*-y| Oc.yeF) (2.8) 

when / is called a Lipschitz mapping, and 

H s (f(F)) ^ c s H s (F). (2.9) 

In particular (2.9) holds for any differentiable function with bounded derivative; 
such a function is necessarily Lipschitz as a consequence of the mean value 
theorem. If / is an isometry, i.e. \f(x) — f(y)\ — \x — y\, then H s (f(F)) — 
H S (F). Thus, Hausdorff measures are translation invariant (i.e. H S (F + z) — 
H S (F), where F + z — {x + z'.xe F}), and rotation invariant, as would 
certainly be expected. 
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2.2 Hausdorff dimension 
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Returning to equation (2.1) it is clear that for any given set F C R." and S < 1, 
H S S (F) is non-increasing with s, so by (2.2) H S (F) is also non-increasing. In 
fact, rather more is true: if t > s and {[/,} is a 5-cover of F we have 



X>*i'<Emi'"'mi'^*'"'£m 



(2.10) 



so, taking infima, W£(F) < S'- s 7i s s (F). Letting 5 -+ we see that if H J (F) < oo 
then H'(F) — for f > s. Thus a graph of H S (F) against s (figure 2.3) shows 
that there is a critical value of s at which H S (F) 'jumps' from oo to 0. This 
critical value is called the Hausdorff dimension of F, and written dim H F; it is 
defined for any set F C R". (Note that some authors refer to Hausdorff dimension 
as Hausdorff-Besicovitch dimension.) Formally 

dim H F = inf{s ^ : H s (F) = 0} = sup{s : W*(F) = oo} (2.11) 

(taking the supremum of the empty set to be 0), so that 

W(F) = f? ) tfO<j<djnHF 

[0 if ^ > dim H F. 

If 5 = dimnF, then TL S (F) may be zero or infinite, or may satisfy 

< H S (F) < oo. 



n s (F) 



dim H F 



Figure 2.3 Graph of W (F) against i for a set F. The Hausdorff dimension is the value 
of s at which the 'jump' from oo to occurs 
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A Borel set satisfying this last condition is called an s-set. Mathematically, 
s-sets are by far the most convenient sets to study, and fortunately they occur 
surprisingly often. 

For a very simple example, let F be a flat disc of unit radius in R 3 , From 
familiar properties of length, area and volume, H l (F) — length (F) — oo, 
< H 2 (F) = (4/tt) x area(F) = 4 < oo and H 3 (F) = (6/jt) x vol(F) = 0. 
Thus dim H F = 2, with W(F) = oo if s < 2 and H S (F) = if 5 > 2. 

Hausdorff dimension satisfies the following properties (which might well be 
expected to hold for any reasonable definition of dimension). 

Monotonicity. If E C F then dimeF ^ dimeF. This is immediate from the 
measure property that H S (E) < H S (F) for each s. 

Countable stability. If F\, F2, ... is a (countable) sequence of sets then 
dim H U,=i F i = su PK,'<oo{ dim H^}- Certainly, dim H U~i F > ^ dim H Fj for each 
j from the monotonicity property. On the other hand, if 5 > dime/ 7 , for all i, 
then H s (Fj) — 0, so that H s ([Jfl l F) — 0, giving the opposite inequality. 

Countable sets. If F is countable then dimeF — 0. For if F; is a single point, 
TC°(Fi) — 1 and dime/ 7 ,' = 0, so by countable stability dime U^i F i — 0. 

Open sets. If F C i&" is open, then dim H F — n. For since F contains a ball of 
positive M-dimensional volume, dim H F ^ n, but since F is contained in countably 
many balls, din^F ^ n using countable stability and monotonicity. 

Smooth sets. If F is a smooth (i.e. continuously differentiable) m-dimensional 
submanifold (i.e. m-dimensional surface) of R" then dim H F — m. In particu- 
lar smooth curves have dimension 1 and smooth surfaces have dimension 2. 
Essentially, this may be deduced from the relationship between Hausdorff and 
Lebesgue measures, see also Exercise 2.7. 

The transformation properties of Hausdorff dimension follow immediately 
from the corresponding ones for Hausdorff measures given in Proposition 2.2. 



Proposition 2.3 

Let F C W and suppose that f : F — > W" satisfies a Holder condition 

\f(x)-f(y)\^c\x-y\ a (x,yeF). 

Then dim H /(F) $C (l/a)dim H -F. 

Proof. If 5 > dim H F then by Proposition 2.2 H s/a (f(F)) sC c s/a H s (F) = 0, 
implying that dime/(F) $C s/a for all s > dimeF. □ 



Corollary 2.4 

(a) If f : F — > E. m is a Lipschitz transformation (see (2.8)) then dm\uf(F) ^ 
dimeF. 

(b) If f : F —> E.' n is a bi-Lipschitz transformation, i.e. 
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ci\x-y\^\f(x)-f(y)\^c 2 \x-y\ (x,yeF) (2.13) 

where < C\ ^ Ci < oo, f/zen dimuf(F) = dime/ 7 . 

Proof. Part (a) follows from Proposition 2.3 taking a — I. Applying this to f~ l : 
f(F) — > F gives the other inequality required for (b). □ 

This corollary reveals a fundamental property of Hausdorff dimension: Haus- 
dorff dimension is invariant under bi-Lipschitz transformations. Thus if two sets 
have different dimensions there cannot be a bi-Lipschitz mapping from one onto 
the other. This is reminiscent of the situation in topology where various 'invari- 
ants' (such as homotopy or homology groups) are set up to distinguish between 
sets that are not homeomorphic: if the topological invariants of two sets differ 
then there cannot be a homeomorphism (continuous one-to-one mapping with 
continuous inverse) between the two sets. 

In topology two sets are regarded as 'the same' if there is a homeomor- 
phism between them. One approach to fractal geometry is to regard two sets 
as 'the same' if there is a bi-Lipschitz mapping between them. Just as topo- 
logical invariants are used to distinguish between non-homeomorphic sets, we 
may seek parameters, including dimension, to distinguish between sets that are 
not bi-Lipschitz equivalent. Since bi-Lipschitz transformations (2.13) are neces- 
sarily homeomorphisms, topological parameters provide a start in this direction, 
and Hausdorff dimension (and other definitions of dimension) provide further 
distinguishing characteristics between fractals. 

In general, the dimension of a set alone tells us little about its topological 
properties. However, any set of dimension less than 1 is necessarily so sparse 
as to be totally disconnected; that is, no two of its points lie in the same con- 
nected component. 

Proposition 2.5 

A set F C IR" with dim H -F < 1 is totally disconnected. 

Proof. Let x and y be distinct points of F . Define a mapping / : W — > [0, oo) 

by f(z) — \z — x\. Since / does not increase distances, as \f(z) — f{w)\ — 

\z — x\ — \w — x\ ^ \(z — x) — (w — x)\ = \z — Vo\, we have from Corollary 

2.4(a) that dim H /(F) < dimn^ < 1. Thus f(F) is a subset of E. of ^'-measure 
or length zero, and so has a dense complement. Choosing r with r ^ f(F) and 
< r < f(y) it follows that 

F = [z e F : \z - x\ < r) U {z e F : \z - x\ > r). 

Thus F is contained in two disjoint open sets with x in one set and y in the 
other, so that x and y lie in different connected components of F . □ 
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2.3 Calculation of Hausdorff dimension — simple examples 

This section indicates how to calculate the Hausdorff dimension of some simple 
fractals such as some of those mentioned in the Introduction. Other methods will 
be encountered throughout the book. It is important to note that most dimension 
calculations involve an upper estimate and a lower estimate, which are hopefully 
equal. Each of these estimates usually involves a geometric observation followed 
by a calculation. 

Example 2.6 

Let F be the Cantor dust constructed from the unit square as in figure 0.4. {At 
each stage of the construction the squares are divided into 16 squares with a 
quarter of the side length, of which the same pattern of four squares is retained.) 
Then 1 ^ H l (F) ^ </2, so dim R F = 1. 

Calculation. Observe that E^, the feth stage of the construction, consists of 4 k 
squares of side A~ k and thus of diameter 4~ k ^/2. Taking the squares of Ey_ as a 
<5-cover of F where S — 4~* v / 2, we get an estimate Ti\{F) ^ A h A~ k ^/2 for the 
infimum in (2.1). As k — > oo so S — > giving H l (F) SC *J2, 

For the lower estimate, let proj denote orthogonal projection onto the x-axis. 
Orthogonal projection does not increase distances, i.e. |proj x — proj y\ SC \x — y\ 
if x, y e R 2 , so proj is a Lipschitz mapping. By virtue of the construction of F, 
the projection or 'shadow' of F on the x-axis, proj F, is the unit interval [0, 1]. 
Using (2.9) 

1 = length [0, 1] = TiHlO, 1]) = ^'(projF) sC H\F). □ 

Note that the same argument and result hold for a set obtained by repeated 
division of squares into m 2 squares of side length 1/m of which one square in 
each column is retained. 

This trick of using orthogonal projection to get a lower estimate of Hausdorff 
measure only works in special circumstances and is not the basis of a more 
general method. Usually we need to work rather harder! 

Example 2.7 

Let F be the middle third Cantor set (see figure 0.1). If s — log 2/ log 3 = 
0.6309 . . . then dim H F = s and \ ^ W(F) ^ 1. 

Heuristic calculation. The Cantor set F splits into a left part F h — F D [0, |] and 
a right part Fr = F D [t , 1]. Clearly both parts are geometrically similar to F 
but scaled by a ratio i and F — F L U F R with this union disjoint. Thus for any s 

n s (F) = n s (F h ) + n s (F R ) = dyn s (F) + dyn s (F) 
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by the scaling property 2.1 of Hausdorff measures. Assuming that at the critical 
value s — dim^F we have < H S (F) < oo (a big assumption, but one that can 
be justified) we may divide by 7i s (F) to get 1 = 2(|) s or* = log 2/ log 3. 

Rigorous calculation. We call the intervals that make up the sets E k in the con- 
struction of F level-k intervals. Thus E^ consists of 2 k level-/: intervals each of 
length 3~ k . 

Taking the intervals of E^ as a 3~*-cover of F gives that H s 3 _ k (F) ^ 2*3 _ts 
= 1 if s = log 2/ log 3. Letting k -> oo gives W(F) sC 1. 

To prove that 7i s (F) ^ i we show that 



Era 



(2.14) 



for any cover {t/,} of F. Clearly, it is enough to assume that the {{/,•} are intervals, 
and by expanding them slightly and using the compactness of F, we need only 
verify (2.14) if {£/,} is a finite collection of closed subintervals of [0, 1]. For each 
[/,-, let k be the integer such that 



l-(*+l) 



^\Ui\<3- k . (2.15) 



Then £/, can intersect at most one level-/: interval since the separation of these 
level-A: intervals is at least 3~ k . If j ^ k then, by construction, (/, intersects 
at most V~ k = V3' sk ^ V 3 s | {/,• | s level-j intervals of E h using (2.15). If we 
choose j large enough so that 3~^ + ' ^ |f/,| for all Uj, then, since the {£/,-} 
intersect all V basic intervals of length 3 _i , counting intervals gives 2 7 ^ 
J2i2 j y\Ui\ s , which reduces to (2.14). □ 

With extra effort, the calculation can be adapted to show that H S (F) — 1. 

It is already becoming apparent that calculation of Hausdorff measures and 
dimensions can be a little involved, even for simple sets. Usually it is the lower 
estimate that is awkward to obtain. 

The 'heuristic' method of calculation used in Example 2.7 gives the right 
answer for the dimension of many self-similar sets. For example, the von Koch 
curve is made up of four copies of itself scaled by a factor j, and hence has 
dimension log4/log3. More generally, if F — UI'Li Fi> where each F t is geo- 
metrically similar to F but scaled by a factor c, then, provided that the Fj 
do not overlap 'too much', the heuristic argument gives dimni 7 as the num- 
ber 5 satisfying Yl?=\ c "i — 1- The validity of this formula is discussed fully in 
Chapter 9. 



*2.4 Equivalent definitions of Hausdorff dimension 

It is worth pointing out that there are other classes of covering set that define 
measures leading to Hausdorff dimension. For example, we could use coverings 
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by spherical balls: letting 

B S S (F) = inf{E|£«f : {B,} is a 5-cover of F by balls} (2.16) 

we obtain a measure B S (F) — lim s ^o B S S (F) and a 'dimension' at which B S (F) 
jumps from oo to 0. Clearly H S S (F) < B S S (F) since any 5-cover of F by balls 
is a permissible covering in the definition of TL\. Also, if {(/,} is a 5-cover of 
F, then {B,} is a 2.5-cover, where, for each /, B, is chosen to be some ball 
containing (/,• and of radius |(/,K $■ Thus £|B,-|' S sC S(2|(/,|) s = 2' ? E|(/,|'% and 
taking infima gives B s n (F) sC 2 S H" S (F). Letting S -> it follows that W J (F) sC 
B S (F) SC 2 S H S (F). In particular, this implies that the values of s at which 7-^' ? 
and S s jump from oo to are the same, so that the dimensions defined by the 
two measures are equal. 

It is easy to check that we get the same values for Hausdorff measure and 
dimension if in (2.1) we use <5-covers of just open sets or just closed sets. More- 
over, if F is compact, then, by expanding the covering sets slightly to open 
sets, and taking a finite subcover, we get the same value of H S (F) if we merely 
consider 5-covers by finite collections of sets. 

Net measures are another useful variant. For the sake of simplicity let F 
be a subset of the interval [0, 1). A binary interval is an interval of the form 
[r2~ k , (r + 1)2^*) where k = 0, 1, 2, ... and r = 0, 1, .... 2 k - 1. We define 

M S S (F) = inf{E|t/,f : {[/,} is a S-cover of F by binary intervals} (2.17) 

leading to the net measures 

M s (F) = limMUF). (2.18) 

Since any interval U C [0, 1) is contained in two consecutive binary intervals 
each of length at most 2\U\ we see, in just the same way as for the measure 
B", that 

n s (F) ^ m\f) s; 2 s+l n s (F). (2.19) 

It follows that the value of s at which M s (F) jumps from oo to equals the Haus- 
dorff dimension of F, i.e. both definitions of measure give the same dimension. 
For certain purposes net measures are much more convenient than Hausdorff 
measures. This is because two binary intervals are either disjoint or one of them 
is contained in the other, allowing any cover of binary intervals to be reduced to 
a cover of disjoint binary intervals. 

*2.5 Finer definitions of dimension 

It is sometimes desirable to have a sharper indication of dimension than just a 
number. To achieve this let h : K + — »• K + be a function that is increasing and 
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continuous, which we call a dimension function or gauge function. Analogously 
to (2.1) we define 

H h s (F) = mf{Y,h(\Ui\) : {£/,-} is a S-cover of F] (2.20) 

for F a subset of R n . This leads to a measure, taking TL h (F) — limj_»o W*(F). 
(If h(t) — f this is the usual definition of s -dimensional Hausdorff measure.) If 
h and g are dimension functions such that h(t)/g(t) — > as t — > then, by an 
argument similar to (2.10), we get that 7i h (F) — whenever 7i g (F) < oo. Thus 
partitioning the dimension functions into those for which H h is finite and those 
for which it is infinite gives a more precise indication of the 'dimension' of F 
than just the number dim^. 

An important example of this is Brownian motion in R 3 (see Chapter 16 
for further details). It may be shown that (with probability 1) a Brownian path 
has Hausdorff dimension 2 but with 7^ 2 -measure equal to 0. More refined cal- 
culations show that such a path has positive and finite H h -measure, where 
hit) — t 1 loglog(l/?)- Although Brownian paths have dimension 2, the dimen- 
sion is, in a sense, logarithmically smaller than 2. 



2.6 Notes and references 

The idea of defining measures using covers of sets was introduced by 
Caratheodory (1914). Hausdorff (1919) used this method to define the measures 
that now bear his name, and showed that the middle third Cantor set has positive 
and finite measure of dimension log 2/ log 3. Properties of Hausdorff measures 
have been developed ever since, not least by Besicovitch and his students. 

Technical aspects of Hausdorff measures and dimensions are discussed in rather 
more detail in Falconer (1985a), and in greater generality in the books of Rogers 
(1998), Federer (1996) and Mattila (1995). Merzenich and Staiger (1994) relate 
Hausdorff dimension to formal languages and automata theory. 



Exercises 

2.1 Verify that the value of TC(F) is unaltered if, in (2.1), we only consider 5-covers 
by sets {[/,} that are all closed. 

2.2 Show that Ti°(F) equals the number of points in the set F. 

2.3 Verify from the definition that W(0) = 0, that W (E) C H '(F) if E C F, and that 

2.4 Let F be the closed interval [0, 1]. Show that W (F) = oo if < y < 1, that 
H' '(F) = if s > 1, and that < H 1 (F) < oo. 

2.5 Let / : R. — *■ R. be a differentiable function with continuous derivative. Show that 
divan f(F) < dim H F for any set F. (Consider the case of F bounded first and show 
that / is Lipschitz on F.) 
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2.6 Let / : R -*■ R be the function f(x) = x 2 , and let F be any subset of K. Show that 
dim H /(-F) = dim H F. 

2.7 Let / : [0, 1] -*■ D& be a Lipschitz function. Writing graph/ = {(*, /(*)) : < x 
^ 1), show that diningraph/ = 1. Note, in particular, that this is true if / is con- 
tinuously differentiable, see Exercise 1.13. 

2.8 What is the Hausdorff dimension of the sets {0, 1, 2, 3, . . .) and {0, 1, \, \,\,.. •) 
in W] 

2.9 Let F be the set consisting of the numbers between and 1 whose decimal expan- 
sions do not contain the digit 5. Use a 'heuristic' argument to show that diniHF = 
log 9/ log 10. Can you prove this by a rigorous argument? Generalize this result. 

2.10 Let F consist of the points (x,y) e \H 2 such that the decimal expansions of nei- 
ther x or y contain the digit 5. Use a 'heuristic' argument to show that dim H F = 
2 log 9/ log 10. 

2.11 Use a 'heuristic' argument to show that the Hausdorff dimension of the set depicted 
in figure 0.5 is given by the solution of the equation 4(|) ! + (j) s = 1. By solving 
a quadratic equation in (|) v , find an explicit expression for s. 

2.12 Let F be the set of real numbers with base-3 expansion b m b m -\ ■ ■ ■ b\ ■ a\ai ■ ■ ■ with 
none of the digits b t or a { equal to 1 . (Thus F is constructed by a Cantor-like process 
extending outwards as well as inwards.) What is the Hausdorff dimension of F? 

2.13 What is the Hausdorff dimension of the set of numbers x with base-3 expansion 
• flifl2 • • • for which there is a positive integer k (which may depend on x) such 
that a,- £ 1 for all i > it? 

2.14 Let F be the middle- A. Cantor set (obtained by removing a proportion < X < 1 
from the middle of intervals). Use a 'heuristic argument' to show that dim H F = 
log2/log(2/(l - A.)). Now let E = F x F <zR 2 . Show in the same way that 
dim H E = 21og2/log(2/(l - X)). 

2.15 Show that there is a totally disconnected subset of the plane of Hausdorff dimension 
s for every ^ s < 2. (Modify the construction of the Cantor dust in figure 0.4.) 

2.16 Let S be the unit circle in the plane, with points on S parameterized by the angle 8 
subtended at the centre with a fixed axis, so that 9\ and $2 represent the same point if 
and only if 0\ and 62 differ by a multiple of 2n, in the usual way. Let F = {6 e S : 
< 3 k 9 < jr(mod2jr) for all k = 1, 2, . . .}. Show that dim H F = log 2/ log 3. 

2.17 Show that if h and g are dimension functions such that h(t)/g(t) -*■ as t — ► 
then H h (F) = whenever W{F) < 00. 



Chapter 3 Alternative definitions 
of dimension 



Hausdorff dimension, discussed in the last chapter, is the principal definition of 
dimension that we shall work with. However, other definitions are in widespread 
use, and it is appropriate to examine some of these and their inter-relationship. Not 
all definitions are generally applicable — some only describe particular classes of 
set, such as curves. 

Fundamental to most definitions of dimension is the idea of 'measurement at 
scale 8' . For each S, we measure a set in a way that ignores irregularities of size 
less than 8, and we see how these measurements behave as 8 — > 0. For example, 
if F is a plane curve, then our measurement, M$(F), might be the number of 
steps required by a pair of dividers set at length 8 to traverse F. A dimension of 
F is then determined by the power law (if any) obeyed by Mg(F) as 8 — >■ 0. If 

M 5 (F)~c8~ s (3.1) 

for constants c and s, we might say that F has 'divider dimension' s, with c 
regarded as the '^-dimensional length' of F. Taking logarithms 

log M s (F)~logc-s log8 (3.2) 

in the sense that the difference of the two sides tends to with 8, and 

s — lim . (3.3) 

a->o -log 8 

These formulae are appealing for computational or experimental purposes, since s 
can be estimated as minus the gradient of a log-log graph plotted over a suitable 
range of 8; see figure 3.1. Of course, for real phenomena, we can only work 
with a finite range of 8; theory and experiment diverge before an atomic scale is 
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log M S (F) 




log S 

Figure 3.1 Empirical estimation of a dimension of a set F, on the power-law assumption 
Ma(F)~ cS~ s 



reached. For example, if F is the coastline of Britain, plotting a log-log graph 
for S between 20 m and 200 km gives the divider dimension of F about 1.2. 

There may be no exact power law for M S (F), and the closest we can get to 
(3.3) are the lower and upper limits. 

For the value of s given by (3.1) to behave like a dimension, the method of 
measurement needs to scale with the set, so that doubling the size of F and at 
the same time doubling the scale at which measurement takes place does not 
affect the answer; that is, we require M S (SF) — Mi(F) for all S. If we modify 
our example and redefine M$(F) to be the sum of the divider step lengths then 
M S (F) is homogeneous of degree 1, i.e. M S (SF) — S l Mi(F) for S > 0, and this 
must be taken into account when defining the dimension. In general, if M$(F) is 
homogeneous of degree d, that is Mg(SF) — S d M](F), then a power law of the 
form M$(F) ~ cS corresponds to a dimension s. 

There are no hard and fast rules for deciding whether a quantity may reasonably 
be regarded as a dimension. There are many definitions that do not fit exactly into 
the above, rather simplified, scenario. The factors that determine the acceptability 
of a definition of a dimension are recognized largely by experience and intuition. 
In general one looks for some sort of scaling behaviour, a naturalness of the 
definition in the particular context and properties typical of dimensions such as 
those discussed below. 

A word of warning: as we shall see, apparently similar definitions of dimen- 
sion can have widely differing properties. It should not be assumed that different 
definitions give the same value of dimension, even for 'nice' sets. Such assump- 
tions have led to major misconceptions and confusion in the past. It is necessary 
to derive the properties of any 'dimension' from its definition. The properties of 
Hausdorff dimension (on which we shall largely concentrate in the later chapters 
of this book) do not necessarily all hold for other definitions. 
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What are the desirable properties of a 'dimension' ? Those derived in the last 
chapter for Hausdorff dimension are fairly typical. 

Monotonicity . If E C F then dimn£ < dimuF. 

Stability. dimu(E U F) — max(diiriH£\ dime/ 7 ). 

Countable stability. dim H (U^i Fi) — su Pi<<<oo dim H ^. 

Geometric invariance. dim^ f{F) = dim H F if / is a transformation of W 
such as a translation, rotation, similarity or affinity. 

Lipschitz invariance. dm\n f{F) — dimuF if / is a bi-Lipschitz transforma- 
tion. 

Countable sets. dimuF — if F is finite or countable. 

Open sets. If F is an open subset of W then dim H .F = n. 

Smooth manifolds. dim H F = m if F is a smooth m-dimensional manifold 
(curve, surface, etc.). 

All definitions of dimension are monotonic, most are stable, but, as we shall 
see, some common definitions fail to exhibit countable stability and may have 
countable sets of positive dimension. All the usual dimensions are Lipschitz 
invariant, and, therefore, geometrically invariant. The 'open sets' and 'smooth 
manifolds' properties ensure that the dimension is an extension of the classi- 
cal definition. Note that different definitions of dimension can provide different 
information about which sets are Lipschitz equivalent. 



3.1 Box-counting dimensions 

Box-counting or box dimension is one of the most widely used dimensions. 
Its popularity is largely due to its relative ease of mathematical calculation and 
empirical estimation. The definition goes back at least to the 1930s and it has been 
variously termed Kolmogorov entropy, entropy dimension, capacity dimension (a 
term best avoided in view of potential theoretic associations), metric dimension, 
logarithmic density and information dimension. We shall always refer to box or 
box-counting dimension to avoid confusion. 

Let F be any non-empty bounded subset of E." and let Ng(F) be the smallest 
number of sets of diameter at most S which can cover F. The lower and upper 
box-counting dimensions of F respectively are defined as 

log N & {F) 
dim B F = lim & (3.4) 

s^o ~ log $ 

\og N S (F) 

dim B F — lim . (3.5) 

s^o -log S 

If these are equal we refer to the common value as the box-counting dimension 
or box dimension of F 

log N S (F) 

dimeF = lim . (3.6) 

s^o —log 5 
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Here, and throughout the book, we assume that S > is sufficiently small to 
ensure that — log S and similar quantities are strictly positive. To avoid problems 
with 'log 0' or 'log oo' we generally consider box dimension only for non-empty 
bounded sets. In developing the general theory of box dimensions we assume 
that sets considered are non-empty and bounded. 

There are several equivalent definitions of box dimension that are sometimes 
more convenient to use. Consider the collection of cubes in the <5-coordinate 
mesh of IR", i.e. cubes of the form 

[m\8, (mi + 1)8] x ■ ■ ■ x [m n 8, (m n + 1)(5] 

where nt\, . . . , m n are integers. (Recall that a 'cube' is an interval in IR 1 and a 
square in IR 2 .) Let N' S (F) be the number of 5-mesh cubes that intersect F. They 
obviously provide a collection of N' S (F) sets of diameter 8^/n that cover F, so 

N sVn (F) s: N' S (F). 

If 8^/n < 1 then 

log N s j„(F) < log N>(F) 



- log(<5 yfri) - log jn - log 8 
so taking limits as 8 — ► 



and 



log NUF) 
dim p F «; lim & s (3.7) 

s->o -log 8 



log NUF) 

dim B F ^ lim e s . (3.8) 

«->0 -log 8 



On the other hand, any set of diameter at most 8 is contained in 3" mesh cubes 
of side 8 (by choosing a cube containing some point of the set together with its 
neighbouring cubes). Thus 

N' S (F) s: 3"N S (F) 

and taking logarithms and limits as 8 — > leads to the opposite inequalities to 
(3.7) and (3.8). Hence to find the box dimensions (3.4)-(3.6), we can equally 
well take Ng(F) to be the number of mesh cubes of side 8 that intersect F. 

This version of the definitions is widely used empirically. To find the box dimen- 
sion of a plane set F we draw a mesh of squares or boxes of side 8 and count the num- 
ber N S (F) that overlap the set for various small 8 (hence the name 'box-counting'). 
The dimension is the logarithmic rate at which N$(F) increases as 8 — > 0, and may 
be estimated by the gradient of the graph of log N S (F) against — log 8. 

This definition gives an interpretation of the meaning of box dimension. The 
number of mesh cubes of side 8 that intersect a set is an indication of how spread 
out or irregular the set is when examined at scale 8. The dimension reflects how 
rapidly the irregularities develop as 8 —> 0. 
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Another frequently used definition of box dimension is obtained by taking 
N$(F) in (3.4)-(3.6) to be the smallest number of arbitrary cubes of side <5 
required to cover F. The equivalence of this definition follows as in the mesh 
cube case, noting that any cube of side S has diameter 8*Jn, and that any set of 
diameter of at most 8 is contained in a cube of side 8. 

Similarly, we get exactly the same values if in (3.4)-(3.6) we take N$(F) as 
the smallest number of closed balls of radius S that cover F . 

A less obviously equivalent formulation of box dimension has the largest 
number of disjoint balls of radius 8 with centres in F. Let this number be N' S (F), 
and let Si, ... , B N >( F) be disjoint balls centred in F and of radius 8. If x belongs 
to F then x must be within distance 8 of one of the 5, , otherwise the ball of centre 
x and radius 8 can be added to form a larger collection of disjoint balls. Thus the 
N' S (F) balls concentric with the B t but of radius 28 (diameter 48) cover F, giving 

N 4S (F) s: N' S (F). (3.9) 

Suppose also that B\, , , . , Bjv'(f) are disjoint balls of radii 8 with centres in F. 
Let U\, . . . ,Uk be any collection of sets of diameter at most 8 which cover F . 
Since the £/, must cover the centres of the 5, , each 5, must contain at least one 
of the Uj. As the 5, are disjoint there are at least as many Uj as B,. Hence 

N' S (F) ^ N S (F). (3.10) 

Taking logarithms and limits of (3.9) and (3.10) shows that the values of (3.4)- 
(3.6) are unaltered if Ns(F) is replaced by this N' S (F). 

These various definitions are summarized below and in figure 3.2. 

Equivalent definitions 3.1 

The lower and upper box-counting dimensions of a subset F of W are given by 

log N S (F) 
dim»F = lim S V ' (3.11) 

s^o -log 8 

log N S (F) 

dim B F = lim VV (3A2 > 

s-+o -log 8 

and the box-counting dimension of F by 

log N S (F) 

dime/ 7 = lim (3.13) 

s^o -log 8 

(if this limit exists), where N S (F) is any of the following : 

(i) the smallest number of closed balls of radius 8 that cover F; 

(ii) the smallest number of cubes of side 8 that cover F; 
(iii) the number of 8-mesh cubes that intersect F; 
(iv) the smallest number of sets of diameter at most 8 that cover F; 

(v) the largest number of disjoint balls of radius 8 with centres in F. 
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Figure 3.2 Five ways of finding the box dimension of F; see Equivalent definitions 3.1. 
The number N$(F) is taken to be: (i) the least number of closed balls of radius S that 
cover F; (ii) the least number of cubes of side S that cover F; (iii) the number of 5-mesh 
cubes that intersect F; (iv) the least number of sets of diameter at most <5 that cover F; 
(v) the greatest number of disjoint balls of radius <5 with centres in F 

This list could be extended further; in practice one adopts the definition most 
convenient for a particular application. 

It is worth noting that, in (3.1 1)— (3. 13), it is enough to consider limits as S 
tends to through any decreasing sequence 8^ such that 5^ + i ^ c<5* for some 
constant < c < 1; in particular for 5* — c k . To see this, note that if S k+1 ^ S < 
Sic, then, with N S (F) the least number of sets in a 5-cover of F, 



logN s (F) < log Ay 
- log S 



,cn 



logA^ +1 (F) 



- log S k 



< 



log N St+1 (F) 



-logSt+i +log(S k+ i/8 k ) -logS k+ i +logc 
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and so 

— log N S (F) log N h (F) 

lim ^ lim . (3.14) 

.5->0 —log <5 k^co —log&k 

The opposite inequality is trivial; the case of lower limits may be dealt with in 
the same way. 

There is an equivalent definition of box dimension of a rather different form 
that is worth mentioning. Recall that the ^-neighbourhood Fs of a subset F of 
K" is 

F s = {x e W : \x - y\ ^ 8 for some y e F] (3.15) 

i.e. the set of points within distance 8 of F. We consider the rate at which the 
w-dimensional volume of Fs shrinks as 8 — > 0. In IR 3 , if F is a single point 
then Fs is a ball with vol(Fs) — jirS 3 , if F is a segment of length / then Fg is 
'sausage-like' with vol(Fg) ~ nl8 2 , and if F is a flat set of area a then F$ is 
essentially a thickening of F with vol(F^) ~ 2a8. In each case, vol(Fs) ~ c<5 3_ ' s 
where the integer s is the dimension of F, so that exponent of 8 is indicative of 
the dimension. The coefficient c of (5 3 ~ s , known as the Minkowski content of F, 
is a measure of the length, area or volume of the set as appropriate. 

This idea extends to fractional dimensions. If F is a subset of R" and, for some 
s, vol"(F$)/8"~ s tends to a positive finite limit as 8 — > where vol" denotes n- 
dimensional volume, then it makes sense to regard F as s -dimensional. The 
limiting value is called the s-dimensional content of F — a concept of slightly 
restricted use since it is not necessarily additive on disjoint subsets, i.e. is not a 
measure. Even if this limit does not exist, we may be able to extract the critical 
exponent of 8 and this turns out to be related to the box dimension. 

Proposition 3.2 

If F is a subset of R. n , then 

—log vol" (F S ) 
dim B F — n — hm- 



o log 8 

log vol" (F s ) 

dim B F — n — hm 

5^o log 8 

where Fg is the 8-neighbourhood of F . 

Proof. If F can be covered by N S (F) balls of radius 8 < 1 then F s can be covered 
by the concentric balls of radius 28. Hence 

vol" (F 3 ) s: Ns(F)c n (28) n 



46 Alternative definitions of dimension 

where c„ is the volume of the unit ball in E." . Taking logarithms, 

log vol" (Fa) < log2"c n + wlog3 + log^(F) 
— log S — log 8 



log vol" (F 5 ) 

lim g: — n + dim R F (3.16) 

~) - log S 

with a similar inequality for the upper limits. On the other hand if there are 
Ng(F) disjoint balls of radius 8 with centres in F, then by adding their volumes, 

N s (F)c„8" ^ vol" (^). 

Taking logarithms and letting 5^0 gives the opposite inequality to (3.16), using 
Equivalent definition 3.1(v). □ 

In the context of Proposition 3.2, box dimension is sometimes referred to as 
Minkowski dimension or Minkowski-Bouligand dimension. 

It is important to understand the relationship between box-counting dimension 
and Hausdorff dimension. If F can be covered by Ng(F) sets of diameter S, then, 
from definition (2.1), 

K\(F) ^ N S (F)8 S . 

If 1 < H S (F) = lim s ^ H s s (F) then log 7Va(F) + slog ,5 > if 8 is sufficiently 
small. Thus s $C fim^ n log N&(F)/— log 8 so 

dimnF SC dim p F SC dimaF (3.17) 

for every FcR*. We do not in general get equality here. Although Hausdorff 
and box dimensions are equal for many 'reasonably regular' sets, there are plenty 
of examples where this inequality is strict. 

Roughly speaking (3.6) says that N S (F) ~ 8~ s for small 8, where s — dim B F. 
More precisely, it says that 

N$(F)8 S -¥■ oo if s < dim B F 

and 

N S (F)8 S -+ ifs>dim B F. 

But 

N S (F)8 S = inf I Y^ &S ■ Wi] is a (finite) S-cover of F 1 , 
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which should be compared with 



■|z> 



HUF) = inf \ > \Ui\ s : {[/,} is a 5-cover of F 



which occurs in the definitions of Hausdorff measure and dimension. In calculat- 
ing Hausdorff dimension, we assign different weights |t/,| ? to the covering sets 
Ut , whereas for the box dimensions we use the same weight 5 s for each covering 
set. Box dimensions may be thought of as indicating the efficiency with which 
a set may be covered by small sets of equal size, whereas Hausdorff dimension 
involves coverings by sets of small but perhaps widely varying size. 

There is a temptation to introduce the quantity v(F) — ]im s ^ N$(F)8 s , but 
this does not give a measure on subsets of W. As we shall see, one consequence 
of this is that box dimensions have a number of unfortunate properties, and can 
be awkward to handle mathematically. 

Since box dimensions are determined by coverings by sets of equal size they 
tend to be easier to calculate than Hausdorff dimensions. Just as with Hausdorff 
dimension, calculations of box dimension usually involve finding a lower bound 
and an upper bound separately, each bound depending on a geometric observation 
followed by an algebraic estimate. 

Example 3.3 

Let F be the middle third Cantor set (figure 0.1). Then dim p F — dim^F — 
log 2/ log 3. 

Calculation. The obvious covering by the 2 k level-£ intervals of E^ of length 3~ k 
gives that N S (F) ^ 2 k if 3^ < 8 ^ 3 _ * +1 . From (3.5) 

— _ —log N s (F) __ log 2 k log 2 
dim^F = lim ^ lim ; — - = . 

*-*0 —log 8 k^co\og3 k ~ 1 log 3 

On the other hand, any interval of length 8 with 3~ k ~ l ^ 8 < 3~ k intersects at 
most one of the level-fe intervals of length 3~ k used in the construction of F. 
There are 2 k such intervals so at least 2 k intervals of length 8 are required to 
cover F. Hence N$(F) ^ 2 k leading to dim B F ^ log 2/ log 3. □ 

Thus, at least for the Cantor set, dim H F = dim B F. 



3.2 Properties and problems of box-counting dimension 

The following elementary properties of box dimension mirror those of Hausdorff 
dimension, and may be verified in much the same way. 
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(i) A smooth m-dimensional submanifold of IR" has dimB/ 7 = m. 
(ii) dim B and dim B are monotonic. 
(iii) dime is finitely stable, i.e. 

dirriB^ U F) — max {drniB-E, dimB/ 7 }; 

the corresponding identity does not hold for dim B . 
(iv) dim B and dims are bi-Lipschitz invariant. This is so because, if \f(x) — 
f(y)\ ^ c\x — y\ and F can be covered by N$(F) sets of diameter at most 
8, then the N$(F) images of these sets under / form a cover of f{F) 
by sets of diameter at most cS, thus dims f(F) ^ dim^F. Similarly, box 
dimensions behave just like Hausdorff dimensions under bi-Lipschitz and 
Holder transformations. 

We now start to encounter the disadvantages of box-counting dimension. The 
next proposition is at first appealing, but has undesirable consequences. 

Proposition 3.4 

Let F denote the closure of F {i.e. the smallest closed subset of W containing 
F). Then 

dim B F — dim B F 

and 

dim B F = dims/ 7 . 

Proof. Let B\, . . , , B% be a finite collection of closed balls of radii S. If the closed 
set U;=i Bi contains F, it also contains F . Hence the smallest number of closed 
balls of radius S that cover F equals the smallest number required to cover the 
larger set F. The result follows. □ 

An immediate consequence of this is that if F is a dense subset of an open 
region of W then dim B F = dims/ 7 = n. For example, let F be the (countable) set 
of rational numbers between and 1. Then F is the entire interval [0, 1], so that 
dim B F — dim^F — 1. Thus countable sets can have non-zero box dimension. 
Moreover, the box-counting dimension of each rational number regarded as a 
one-point set is clearly zero, but the countable union of these singleton sets 
has dimension 1. Consequently, it is not generally true that dims U^i &i — 
sup,- dimB/ 7 ,. 

This severely limits the usefulness of box dimension — introducing a small, i.e. 
countable, set of points can play havoc with the dimension. We might hope to sal- 
vage something by restricting attention to closed sets, but difficulties still remain. 

Example 3.5 

F — {0, 1, 4, 4, . . .} is a compact set with dvm^F — \. 
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Calculation. Let < S < \ and let k be the integer satisfying l/(k — l)k > S ^ 
l/k(k + 1). If \U\ ^ S, then U can cover at most one of the points {1, i, . . . , \jk\ 
since \/(k — 1) — \/k — l/(k — \)k > S. Thus at least k sets of diameter S are 
required to cover F, so Ns(F) ^ k giving 

log N S (F) logk 



-log S log k(k+ 1) 

Letting S — >■ so fe — > oo gives dim p F ^ j. On the other hand, if j > S > 0, 
take £ such that 1/(1 - l)jk > S ^ l/^(/t + 1). Then (k + 1) intervals of length 
S cover [0, l/k], leaving k — 1 points of f which can be covered by another 
k — \ intervals. Thus N$(F) ^ 2£, so 

logA^F) < log(2*) 



-log 5 log k(k- 1) 

giving 

dim B F ^ |. D 

No-one would regard this set, with all but one of its points isolated, as a fractal, 
yet it has large box dimension. 

Nevertheless, as well as being convenient in practice, box dimensions are very 
useful in theory. If, as often happens, it can be shown that a set has equal box 
and Hausdorff dimensions, the interplay between these definitions can be used 
to powerful effect. 

*3.3 Modified box-counting dimensions 

There are ways of overcoming the difficulties of box dimension outlined in the 
last section. However, they may not at first seem appealing since they re-introduce 
all the difficulties of calculation associated with Hausdorff dimension and more. 
For F a subset of W we can try to decompose F into a countable number of 
pieces F\, Fi, . . . in such a way that the largest piece has as small a dimension 
as possible. This idea leads to the following modified box-counting dimensions : 

dim MB F - inf \ sup dim B F-, : F C [J F t \ (3.18) 

dim M B^ = inf | sup dimBF, : F C ^J F, <\ . (3.19) 

(In both cases the infimum is over all possible countable covers {F,} of F.) 
Clearly dim MP F ^ dim p F and dim MB F SC dim B F. However, we now have that 
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dim MB F — dimMB^ = if F is countable — just take the F; to be one-point sets. 
Moreover, for any subset F of W, 

^ dime/ 7 $5 dim MB F $C diiriM B F ^ dim B F ^ re. (3.20) 

It is easy to see that dim MB and diniMB recover all the desirable properties of 
a dimension, but they can be hard to calculate. However, there is a useful test 
for compact sets to have equal box and modified box dimensions. It applies to 
sets that might be described as 'dimensionally homogeneous'. 

Proposition 3.6 

Let F C W be compact. Suppose that 

dim B (Fn V) = dim B F (3.21) 

for all open sets V that intersect F. Then dim B F — dimMB F. A similar result 
holds for lower box-counting dimensions. 

Proof. Let F C U^i ^ ^vith each F t closed. A version of Baire's category theo- 
rem (which may be found in any text on basic general topology, and which we 
quote without proof) states that there is an index i and an open set V C R" such 
that F fl V C F t . For this ;', dim B Fi — dim B F. Using (3.19) and Proposition 3.4 

dim MB F — inf \ sup dim B F, ■ : F C II F t where the F, are closed sets [ 

^ dimE^. 

The opposite inequality is contained in (3.20). A similar argument deals with 
the lower dimensions. □ 

For an application, let Fbea compact set with a high degree of self-similarity, 
for instance the middle third Cantor set or von Koch curve. If V is any open set 
that intersects F , then F f~l V contains a geometrically similar copy of F which 
must have upper box dimension equal to that of F, so that (3.21) holds, leading 
to equal box and modified box dimensions. 



*3.4 Packing measures and dimensions 

Unlike Hausdorff dimension, neither the box dimensions or modified box dimen- 
sions are defined in terms of measures, and this can present difficulties in their 
theoretical development. Nevertheless, the circle of ideas in the last section may 
be completed in a way that is, at least mathematically, elegant. Recall that Haus- 
dorff dimension may be defined using economical coverings by small balls (2.16) 
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whilst dim B may be defined using economical coverings by small balls of equal 
radius (Equivalent definition 3.1(i)). On the other hand dime may be thought of 
as a dimension that depends on packings by disjoint balls of equal radius that 
are as dense as possible (Equivalent definition 3.1(v)). Coverings and packings 
play a dual role in many areas of mathematics and it is therefore natural to try to 
look for a dimension that is defined in terms of dense packings by disjoint balls 
of differing small radii. 

We try to follow the pattern of definition of Hausdorff measure and dimension. 
For s > and 8 > 0, let 

Pf(F) — sup I Y^ \Bj\ s : {fi,} is a collection of disjoint balls of radii at 

most 8 with centres in F \ . (3.22) 



Since P'g(F) decreases with 8, the limit 

P°(F) = limP s s (F) (3.23) 

exists. At this point we meet the problems encountered with box-counting dimen- 
sions. By considering countable dense sets it is easy to see that PZ(F) is not a 
measure. Hence we modify the definition to 



P s (F) = inf | £ n ( F i) ■ F C U F . (3.24) 



It may be shown that P S (F) is a measure on R n , known as the s-dimensional 
packing measure. We may define the packing dimension in the natural way: 

dimpF = sup{s : P S (F) = oo} = inf{^ : P S (F) = 0}. (3.25) 

The underlying measure structure immediately implies monotonicity: that 
dim P £ ^ dirapF if E C F. Moreover, for a countable collection of sets {F t }, 



dimp I II Fi J = supdimp/ 7 ',, (3.26) 

\/=i / 

since if s > dimp/ 7 , for all ;', then T^flJ,- F t ) ^ JZ- P S {F{) = implying 

dim P (U,^) ^*. 

We now investigate the relationship of packing dimension with other definitions 
of dimension and verify the surprising fact that packing dimension is just the same 
as the modified upper box dimension. 
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Lemma 3.7 

dimpf ^ dimfi/ 7 . (3.27) 

Proof. If dim P F = 0, the result is obvious. Otherwise choose any / and 5 with 

< t < s < dim F F. Then V S (F) = oo, so Vq(F) = oo. Thus, given < <5 < 1, 
there are disjoint balls {B,}, of radii at most 8 with centres in F, such that 

1 < £°f j|.B,'|' 5 . Suppose that, for each k, exactly n^ of these balls satisfy 2~ k ~ l < 
|£,K2- t ;then 

OO 

\<Y,n k 2- ks . (3.28) 

k=o 

There must be some k with n^ > 2 kt {\ — 2'~ s ), otherwise the sum in (3.28) is 
at most T,^L 2 k '~ ks (1 — 2'~ s ) — 1, by summing the geometric series. These n^ 
balls all contain balls of radii 2 - * -2 ^ S centred in F. Hence if N S (F) denotes 
the greatest number of disjoint balls of radius S with centres in F, then 

N 2 -i ! -2(F)(2- k - 2 )' ^ n k (2- k - 2 )' > 2~ 2 '(1 - 2'~ s ) 

where 2~*" 2 < S. It follows that ]mi s ^ N s (F)S' > 0, so that dun B F ^ t 
using Equivalent definition 3.1(v). This is true for any < t < dimpF so 
(3.27) follows. □ 

Proposition 3.8 

If F cW 1 then d\m ¥ F = HIm MB F. 

Proof. If F c U~i F i then > b y (3-26) and (3.27), 

dim P F $; supdimpF, $; supdim B F,. 

i i 

Definition (3.19) now gives that dim P F ^ dim MB ,F . 

Conversely, if s > dim P ,F then V s (F) — 0, so that F C U; Fi f° r a collection 
of sets Fi with Vq(Fj) < oo for each ;', by (3.24). Hence, for each i, if 8 is small 
enough, then Vg(Fj) < oo, so by (3.22) Ns(Fi)8 s is bounded as 8 — > 0, where 
Ns(Fj) is the largest number of disjoint balls of radius 8 with centres in F,. By 
Equivalent definition 3.1(v) dims/ 7 ; ^ s for each i, giving that dimMB^ ^ s by 
(3.19), as required. □ 

We have established the following relations: 

dimuF ^ dim MP F ^ diiriMB-F = dimp/ 7 ^ dime/ 7 . (3.29) 

Suitable examples show that none of the inequalities can be replaced by equality. 
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As with Hausdorff dimension, packing dimension permits the use of powerful 
measure theoretic techniques in its study. The introduction of packing measures 
(remarkably some 60 years after Hausdorff measures) has led to a greater under- 
standing of the geometric measure theory of fractals, with packing measures 
behaving in a way that is 'dual' to Hausdorff measures in many respects. Indeed 
corresponding results for Hausdorff and packing measures are often presented 
side by side. Nevertheless, one cannot pretend that packing measures and dimen- 
sions are easy to work with or to calculate; the extra step (3.24) in their definition 
makes them more awkward to use than the Hausdorff analogues. 

This situation is improved slightly by the equality of packing dimension and 
the modified upper box dimension. It is improved considerably for compact sets 
with 'local' dimension constant throughout — a situation that occurs frequently 
in practice, in particular in sets with some kind of self- similarity. 

Corollary 3.9 

Let F C R" be compact and such that 

dim B (Fn V) = dime/ 7 (3.30) 

for all open sets V that intersect F, Then dim P F — dim B F. 

Proof. This is immediate from Propositions 3.6 and 3.8. □ 

The nicest case, of course, is of fractals with equal Hausdorff and upper box 
dimensions, in which case equality holds throughout (3.29) — we shall see many 
such examples later on. However, even the much weaker condition dim H F = 
dimp/ 7 , though sometimes hard to prove, eases analysis of F. 

3.5 Some other definitions of dimension 

A wide variety of other definitions of dimension have been introduced, many of 
them only of limited applicability, but nonetheless useful in their context. 

The special form of curves gives rise to the several definitions of dimension. 
We define a curve or Jordan curve C to be the image of an interval [a , b] under a 
continuous bijection / : [a, b] — > K". (Thus, we restrict attention to curves that 
are non-self-intersecting.) If C is a curve and S > 0, we define M S (C) to be the 
maximum number of points xq, X\, . . . , x m , on the curve C, in that order, such 
that \Xk — Xk-i\ — & for k — 1,2, ... ,m. Thus (M S (C) — 1)5 may be thought of 
as the 'length' of the curve C measured using a pair of dividers with points set 
at a distance S apart. The divider dimension is defined as 

log M S (C) 
lim S V ' (3.31) 

s^o -log S 
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assuming the limit exists (otherwise we may define upper and lower divider 
dimensions using upper and lower limits). It is easy to see that the divider dimen- 
sion of a curve is at least equal to the box dimension (assuming that they both 
exist) and in simple self-similar examples, such as the von Koch curve, they 
are equal. The assertion that the coastline of Britain has dimension 1.2 is usu- 
ally made with the divider dimension in mind — this empirical value comes from 
estimating the ratio in (3.31) for values of S between about 20 m and 200 km. 

A variant of Hausdorff dimension may be defined for curves by using intervals 
of the curves themselves as covering sets. Thus we look at inf{ Yl?=i \f\. t i~ i > U ] \ s } 
where the infimum is over all dissections a — to < t\ < ■ ■ ■ < t m — b such that 
the diameters |/([?,-_i, f,])| are all at most 8. We let S tend to and deem the value 
of s at which this limit jumps from oo to to be the dimension. For self-similar 
examples such as the von Koch curve, this equals the Hausdorff dimension, but 
for 'squeezed' curves, such as graphs of certain functions (see Chapter 11) we 
may get a somewhat larger value. 

Sometimes, we are interested in the dimension of a fractal F that is the bound- 
ary of a set A. We can define the box dimension of F in the usual way, but 
sometimes it is useful to take special account of the distinction between A and 
its complement. Thus the following variation of the \s- -dimensional content' def- 
inition of box dimension, in which we take the volume of the set of points 
within distance S of F that are contained in A is sometimes useful. We define 
the one-sided dimension of the boundary F of a set A in R" as 

n — lim (3.32) 

*->o log S 

where F$ is the S -neighbourhood of F (compare Proposition 3.2). This definition 
has applications to the surface physics of solids where it is the volume very 
close to the surface that is important and also to partial differential equations in 
domains with fractal boundaries. 

It is sometimes possible to define dimension in terms of the complement of a 
set. Suppose F is obtained by removal of a sequence of intervals h, h, ■ ■ ■ from, 
say, the unit interval [0, 1], as, for example, in the Cantor set construction. We 
may define a dimension as the number Sq such that the series 



£ 

7=1 



Ij\ s converges if s < sq and diverges if 5 > Sq; (3.33) 



the number So is called the critical exponent of the series. For the middle third 
Cantor set, this series is YlT=i 2* _1 3 _fcs , giving so = log2/log3, equal to the 
Hausdorff and box dimensions in this case. In general, Sq equals the upper box 
dimension of F. 

Dimension prints provide an interesting variation on Hausdorff dimension of 
a rather different nature. Dimension prints may be thought of as a sort of 'fin- 
gerprint' that enables sets with differing characteristics to be distinguished, even 
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though they may have the same Hausdorff dimension. In particular they reflect 
non-isotropic features of a set. 

We restrict attention to subsets of the plane, in which case the dimension print 
will also be planar. The definition of dimension prints is very similar to that 
of Hausdorff dimension but coverings by rectangles are used with side lengths 
replacing diameters. Let U be a rectangle (the sides need not be parallel to the 
coordinate axes) and let a(U) ^ b{U) be the lengths of the sides of U . Let s, t 
be non-negative numbers. For F a subset of R 2 , let 

H S S ''(F) = inf I ^a(Ui) s b(Ui)' : {£/,-} is a 5-cover of F by rectangles I . 

In the usual way, we get measures of 'Hausdorff type', Ti s,t , by letting <5 ^ 0: 

H SJ (F) = limH s /(F). 

(Note that H s '° is just a minor variant of s -dimensional Hausdorff measure where 
only rectangles are allowed in the 5-covers.) The dimension print, print/ 7 , of F 
is defined to be the set of non-negative pairs (s, t) for which H S ''(F) > 0. 

Using standard properties of measures, it is easy to see that we have mono- 
tonicity 

prints C print F 2 if F x C F 2 (3.34) 

and countable stability 

(00 \ oo 

U^J = U p rint/? <- (335) 

1=1 / 1=1 

Moreover, if (s, t) is a point in print F and (s' , t') satisfies 

s' + t' ^ S + t 

t' ^ t (3.36) 

then (s', t') is also in prints. 

Unfortunately, dimension prints are not particularly easy to calculate. We dis- 
play a few known examples in figure 3.3. Notice that the Hausdorff dimension 
of a set is given by the point where the edge of its print intersects the x-axis. 

Dimension prints are a useful and appealing extension of the idea of Haus- 
dorff dimension. Notice how the prints in the last two cases distinguish between 
two sets of Hausdorff (or box) dimension \j, one of which is dust-like, the 
other stratified. 

One disadvantage of dimension prints defined in this way is that they are not 
Lipschitz invariants. The straight line segment and smooth convex curve are bi- 
Lipschitz equivalent, but their prints are different. In the latter case the dimension 
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Figure 3.3 A selection of dimension prints of plane sets 



print takes into account the curvature. It would be possible to avoid this difficulty 
by redefining print F as the set of (s, t) such that H S ''(F') > for all bi-Lipschitz 
images F' of F. This would restore Lipschitz invariance of the prints, but would 
add further complications to their calculation. 
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Of course, it would be possible to define dimension prints by analogy with box 
dimensions rather than Hausdorff dimensions, using covers by equal rectangles. 
Calculations still seem awkward. 



3.6 Notes and references 

Many different definitions of 'fractal dimension' are scattered throughout the 
mathematical literature. The origin of box dimension seems hard to trace — it 
seems certain that it must have been considered by the pioneers of Hausdorff 
measure and dimension, and was probably rejected as being less satisfactory 
from a mathematical viewpoint. Bouligand adapted the Minkowski content to 
non-integral dimensions in 1928, and the more usual definition of box dimension 
was given by Pontrjagin and Schnirelman in 1932. 

Packing measures and dimensions are much more recent, introduced by Tricot 
(1982). Their similarities and contrasts to Hausdorff measures and dimensions 
have proved an important theoretical tool. Packing measures and box and packing 
dimensions are discussed in Mattila (1995) and Edgar (1998). Dimensions of 
curves are considered by Tricot (1995). 

Dimension prints are an innovation of Rogers (1988, 1998). 

Exercises 

3.1 Let / : F — ► IR" be a Lipschitz function. Show that dim p f(F) ^ dim p F and 
dim B /(F) ^ diniB-F. More generally, show that if / satisfies a Holder condition 
\f(x) - f(y)\ < c\x - y\" where c> and < a < 1 then dim B /(F) sC 
£dim B /(F). 

3.2 Verify directly from the definitions that Equivalent definitions 3.1(i) and (iii) give 
the same values for box dimension. 

3.3 Let F consist of those numbers in [0, 1] whose decimal expansions do not contain 
the digit 5. Find dim B F, showing that this box dimension exists. 

3.4 Verify that the Cantor dust depicted in figure 0.4 has box dimension 1 (take Eq to 
have side length 1). 

3.5 Use Equivalent definition 3.1(iv) to check that the upper box dimension of the von 
Koch curve is at most log 4/ log 3 and 3.1(v) to check that the lower box dimension 
is at least this value. 

3.6 Use convenient parts of Equivalent definition 3.1 to find the box dimension of the 
Sierpiriski triangle in figure 0.3. 

3.7 Let F be the middle third Cantor set. For < S < 1, find the length of the S- 
neighbourhood F$ of F, and hence find the box dimension of F using Proposi- 
tion 3.2. 

3.8 Construct a set F for which dim B F < dim B F. (Hint: let k n = 10", and adapt the 
Cantor set construction by deleting, at the /:th stage, the middle | of intervals if 
kin < k ^ &2„+i, but the middle | of intervals if &2„_i < k ^ kin-) 

3.9 Verify that dim B (£ U F) = max{dim B £, dim B F) for bounded E, F C K. 
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3.10 Find subsets E and F of K such that dim p (E U F) > maxj diiri pE', dim p F). (Hint: 
consider two sets of the form indicated in Exercise 3.8.) 

3.11 What are the Hausdorff and box dimensions of the set {0, 1, j, i jt, . . .}? 

3.12 Find two disjoint Borel subsets E and F of I such that V^E U F) ^ P V (£) + 
V (F). 

3.13 What is the packing dimension of the von Koch curve? 

3.14 Find the divider dimension (3.31) of the von Koch curve. 

3.15 Show that the divider dimension (3.31) of a curve is greater than or equal to its box 
dimension, assuming that they both exist. 

3.16 Let < X < 1 and let F be the 'middle X Cantor set' obtained by repeated removal 
of the middle proportion X from intervals. Show that the dimension of F defined 
by (3.33) in terms of removed intervals equals the Hausdorff and box dimensions 
of F. 

3.17 Verify properties (3.34)-(3.36) of dimension prints. Given an example of a set with 
a non-convex dimension print. 



Chapter 4 Techniques for calculating 
dimensions 



A direct attempt at calculating the dimensions, in particular the Hausdorff dimen- 
sion, of almost any set will convince the reader of the practical limitations of 
working from the definitions. Rigorous dimension calculations often involve 
pages of complicated manipulations and estimates that provide little intuitive 
enlightenment. 

In this chapter we bring together some of the basic techniques that are available 
for dimension calculations. Other methods, that are applicable in more specific 
cases, will be found throughout the book. 



4.1 Basic methods 

As a general rule, we get upper bounds for Hausdorff measures and dimensions by 
finding effective coverings by small sets, and lower bounds by putting measures 
or mass distributions on the set. For most fractals 'obvious' upper estimates of 
dimension may be obtained using natural coverings by small sets. 

Proposition 4.1 

Suppose F can be covered by n^ sets of diameter at most 8^ with S^ — *■ as 

k — > oo. Then 

logn^ 

dim H F $C dim B F $C lim . 

t^oo-log^ 

Moreover, if n^S'l remains bounded as k —> oo, then H S (F) < oo. If 5* —> 
but Sk + i ^ cSk for some < c < 1, then 

logn* 



dim B F $C lim 



k-KX)— log^ 

Proof. The inequalities for the box-counting dimension are immediate from the 
definitions and the remark at (3.14). That dim u F SC dim p F is in (3.17), and if 
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ri/cS'l is bounded then 7i s s (F) $C n^, so Tt\ (F) tends to a finite limit H S (F) as 

k -> oo. D 

Thus, as we have seen already (Example 2.7), in the case of the middle third 
Cantor set the natural coverings by 2 h intervals of length 3 _ * give dim H F ^ 
dim p F ^ dim B F SC log 2/ log 3. 

Surprisingly often, the 'obvious' upper bound for the Hausdorff dimension 
of a set turns out to be the actual value. However, demonstrating this can be 
difficult. To obtain an upper bound it is enough to evaluate sums of the form 
X) \Ui\ s for specific coverings {£/,} of F, whereas for a lower bound we must 
show that Yl \Ui\ s is greater than some positive constant for all 5-coverings of F. 
Clearly an enormous number of such coverings are available. In particular, when 
working with Hausdorff dimension as opposed to box dimension, consideration 
must be given to covers where some of the [/, are very small and others have 
relatively large diameter — this prohibits sweeping estimates for ^ |t/,| s such as 
those available for upper bounds. 

One way of getting around these difficulties is to show that no individual set 
U can cover too much of F compared with its size measured as \U\ S . Then if 
{[/,} covers the whole of F the sum ^ |(/,|' ? cannot be too small. The usual way 
to do this is to concentrate a suitable mass distribution \i on F and compare the 
mass ijl{U) covered by U with \U\ S for each U . (Recall that a mass distribution 
on F is a measure with support contained in F such that < /i(F) < oo, see 
Section 1.3.) 



Mass distribution principle 4.2 

Let ix be a mass distribution on F and suppose that for some s there are numbers 
c > and e > such that 

H(U) ^ c\U\ s (4.1) 

for all sets U with \U\ ^ e. Then H S (F) ^ /x(F)/c and 

s $C dimnF ^ dim p F ^ dim^F. 

Proof. If {£/,} is any cover of F then 

< ix(F) ^ n I (J U, ] ^ J^ n (U t ) ^ c J^ Wi T (4.2) 

\ i / I i 

using properties of a measure and (4.1). 

Taking infima, 7i s s (F) ^ fi(F)/c if S is small enough, so H S (F) ^ /x(F)/c. 
Since /x(F) > we get dmin^ ^ s. □ 

Notice that the conclusion H S (F) ^ /i(F)/c remains true if /x is a mass dis- 
tribution on R. n and F is any subset. 
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The Mass distribution principle 4.2 gives a quick lower estimate for the Haus- 
dorff dimension of the middle third Cantor set F (figure 0.1). Let /x be the natural 
mass distribution on F , so that each of the 2 k kth level intervals of length 3~ k 
in Eic in the construction of F carry a mass 2~ k . (We imagine that we start with 
unit mass on Eq and repeatedly divide the mass on each interval of E^ between 
its two subintervals in £^+i; see Proposition 1.7.) Let U be a set with \U\ < 1 
and let k be the integer such that 3~(* +1 > ^ \(J\ < 3~ k . Then U can intersect at 
most one of the intervals of E^, so 

,X(U) ^ 2~ k = (3 l "Syiog3yk = ^-^logl/logS ^ ^j yog 2/ log 3 

and hence n l ° s2/ los3 (F) > 3-i°g 2 /i°s 3 = I by the mass distribution principle, 
giving dim H F ^ log 2/ log 3. 

Example 4.3 

Let F\ — F x [0, 1] C IR 2 be the product of the middle third Cantor set F and the 
unit interval. Then, setting s = 1 + log 2/ log 3, we have dimB^i = dimn-Fi = s, 
with < H S (F X ) < oo. 

Calculation. For each k, there is a covering of F by 2 k intervals of length 3~ k . A 
column of 3 k squares of side 3~ k (diameter 3~ k ^/2) covers the part of F\ above each 
such interval, so taking these all together, F\ may be covered by 2*3* squares of 
side 3- k . Thus H s 3 . k , 2 (F x ) ^ 3*2*(3~V 2 ) S = (3 • 2 • ^-^sV^f^/ 2 = 2 s ' 2 , 
so H S (F X ) $C 2 s / 2 and dim^Fi ^ dim B F\ ^ Aun^Fi $C s. 

We define a mass distribution \x on F\ by taking the natural mass distribution 
on F described above (each kth level interval of F of side 3~ k having mass 
2~ k ) and 'spreading it' uniformly along the intervals above F . Thus if U is a 
rectangle, with sides parallel to the coordinate axes, of height h SC 1 , above a kth 
level interval of F, then /x(t/) = h2~ k . Any set U is contained in a square of 
side \U\ with sides parallel to the coordinate axes. If 3~ (k+1 '> ^ \(J\ < 3~ k then 
U lies above at most one kth level interval of F of side 3~ k , so 

/*(£/) ^ |*7|2 _ * ^ | fy 1 3-* log 2/ log 3 ^ | [/ | ( 3| [/ | ) log2/log3 = 3 log2/log3| £7 |» = 2 \U\ S . 

By the Mass distribution principle 4.2, TL S (F\) > \. □ 

Note that the method of Examples 4.2 and 4.3 extends to a wide variety of 
self-similar sets. Indeed, Theorem 9.3 may be regarded as a generalization of this 
calculation. 

Notice that in this example the dimension of the product of two sets equals the 
sum of the dimensions of the sets. We study this in greater depth in Chapter 7. 

The following general construction of a subset of E. may be thought of as a 
generalization of the Cantor set construction. Let [0, I] = Eq D E\ D E2 D ■ . ■ 
be a decreasing sequence of sets, with each E^ a union of a finite number of 
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disjoint closed intervals (called kth level basic intervals), with each interval of 
Et containing at least two intervals of Ek+\, and the maximum length of kth 
level intervals tending to as k — > oo. Then the set 

F = p) E k (4.3) 

is a totally disconnected subset of [0, 1] which is generally a fractal (figure 4.1). 
Obvious upper bounds for the dimension of F are available by taking the 
intervals of E^ as covering intervals, for each k, but, as usual, lower bounds 
are harder to find. Note that, in the following examples, the upper estimates for 
dim H F depend on the number and size of the basic intervals, whilst the lower 
estimates depend on their spacing. For these to be equal, the (k + l)th level 
intervals must be 'nearly uniformly distributed' inside the kth level intervals. 

Example 4.4 

Let s be a number strictly between and 1 . Assume that in the general construc- 
tion (4.3) for each kth level interval I, the (k + \)th level intervals I\, . . . , l m 
(m ^ 2) contained in I are of equal length and equally spaced, the lengths being 
given by 

i/,f = -m- ? a <»<«) (4.4) 

m 

with the left-hand ends of I\ and I coinciding, and the right-hand ends of I m 
and I coinciding. Then dim H F = s and < TL S (F) < oo. {Notice that m may be 
different for different intervals I in the construction, so that the kth level intervals 
may have widely differing lengths.) 

Calculation. With /, /,, as above, 

Applying this inductively to the kth level intervals for successive k, we have, 
for each k, that 1 = X! I^'I S > where the sum is over all the kth level intervals /,-. 



E 3 



Figure 4.1 An example of the general construction of a subset of 
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The kth level intervals cover F; since the maximum interval length tends to as 
k — > oo, we have H S S (F) $C 1 for sufficiently small 8, giving H S (F) $C 1. 

Now distribute a mass /i on F in such a way that /x(7) = |/| s whenever / 
is any level k interval. Thus, starting with unit mass on [0, 1] we divide this 
equally between each level 1 interval, the mass on each of these intervals being 
divided equally between each level 2 subinterval, and so on; see Proposition 1.7. 
Equation (4.5) ensures that we get a mass distribution on F with /x(/) = \I\ S 
for every basic interval. We estimate ii{U) for an interval U with endpoints in 
F. Let / be the smallest basic interval that contains U; suppose that / is a kth 
level interval, and let I t , . . . , I m be the (k + l)th level intervals contained in /. 
Then U intersects a number j ^ 2 of the /, , otherwise U would be contained in 
a smaller basic interval. The spacing between consecutive /, is 

(|/| - m|/,.|)/(m - 1) = |/|(1 - m|/,|/|/|)/(m - 1) 
= |/|(1 -m l - l/s )/(m- 1) 
>c s \I\/m 

using (4.4) and that m ^ 2 and < s < 1, where c s — (1 — 2 l ~^ s ). Thus 
\U\ > J -^-c s \I\ > J-c s \I\. 



2m 



By (4.4) 



H(U)^JU>Vi) = J\Ii\' = -\I\' 

m 

^2 s c; s U) \U\ s ^2 s c; s \U\\ (4.6) 

This is true for any interval U with endpoints in F, and so for any set U (by 
applying (4.6) to the smallest interval containing U H F). By the Mass distribu- 
tion principle 4.2, W(F) > 0. □ 

A more careful estimate of /J,(U) in Example 4.4 leads to H S (F) — 1. 

We call the sets obtained when m is kept constant throughout the construction 
of Example 4.4 uniform Cantor sets; see figure 4.2. These provide a natural 
generalization of the middle third Cantor set. 

Example 4.5. Uniform Cantor sets 

Let m ^ 2 be an integer and < r < \/m. Let F be the set obtained by the 
construction in which each basic interval I is replaced by m equally spaced subin- 
tervals of lengths r\I\, the ends of I coinciding with the ends of the extreme subin- 
tervals. Then dim H -F = dim B .F = logw/— logr, and < 7^ logm/,_logr (F) < oo. 
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4 



Figure 4.2 A uniform Cantor set (Example 4.5) with m = 3, r = j=, dim H -F = dim B F 
log 3/- log £ =0.831... 



Calculation. The set F is obtained on taking m constant and s — \ogm/(— logr) 
in Example 4.4. Equation (4.4) becomes (r\I\) s — (l/m)|/| s , which is satisfied 
identically, so dimuF — s. For the box dimension, note that F is covered by the 
m k kth level intervals of length r~ k for each k, leading to dimBF $5 log m/— log r 
in the usual way. D 

The middle X Cantor set is obtained by repeatedly removing a proportion 
< X < 1 from the middle of intervals, starting with the unit interval. This is a 
special case of a uniform Cantor set, having m — 2 and r — i(l — X) and thus 
Hausdorff and box dimensions log 2/ log (2/(1 — X)). 

The next example is another case of the general construction. 



Example 4.6 

Suppose in the general construction (4.3) each (k — \)th level interval contains 
at least m k ^ 2 kth level intervals (k — 1,2,...) which are separated by gaps of 
at least s^, where < e^+i < e^ for each k. Then 

log(mi • • • nih_\) 

dinted ^ lim — ^-^ — . (4.7) 

l^ -\og(m k s k ) 

Calculation. We may assume that the right hand side of (4.7) is positive, otherwise 
(4.7) is obvious. We may assume that each (k — l)th level interval contains 
exactly m k feth level intervals; if not we may throw out excess intervals to get 
smaller sets E k and F for which this is so. We may define a mass distribution 
[i on F by assigning a mass of (m\ ■ ■ -m^) -1 to each of the m\ ■ ■ -nit k\h 
level intervals. 

Let U be an interval with < \U\ < £\ ; we estimate lx(U). Let k be the integer 
such that £k ^ \U\ < £n. The number of kth level intervals that intersect U is 

(i) at most mj since U intersects at most one (k — l)th level interval 
(ii) at most (\U\fsic) + 1 ^ 2|[/|/e,t since the kth level intervals have gaps of 
at least £ k between them. 
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Each kth level interval supports mass {ni\ ■ ■ ■ m k )~ x so that 

H(U) ^ (mi ■■■m k y x mm{2\U\/s k ,m k } 
^(m x ---m k r\2\U\/£ k ) s m l k - s 



for every 0^5$; 1. 
Hence 



»(u) < g 

|t/| s " (mi---m k _ l )m s k e s k 



If 



j < lim log (mi ■ ■ ■ m*_i)/- log(m k e k ) 

k—>OG 



then (mi ■ ■ ■ m k _ 1 )m s k sl > 1 for large fc, so //.(£/) ^ 2 S |£/| S , and dim H F ^ J by 
Principle 4.2, giving (4.7). □ 

Now suppose that in Example 4.6 the feth level intervals are all of length S k , 
and that each (k — l)th level interval contains exactly m k kth level intervals, 
which are 'roughly equally spaced' in the sense that m k e k ^ cS^-u where c > 
is a constant. Then (4.7) becomes 

log (mi • • • m t _i) log(m r --m t _i) 

dim H F ^ lim = lim . 

^-^ - log c - log 8*_i £^ -logSn 

But -E^-i comprises ni\- ■ ■ m k _\ intervals of length 5^_i, so this expression equals 
the upper bound for dime/ 7 given by Proposition 4.1. Thus in the situation where 
the intervals are well spaced, we get equality in (4.7). 

Examples of the following form occur in number theory; see Section 10.3. 



Example 4.7 

Fix < s < 1 and let uq, n\, ni, ... he a rapidly increasing sequence of integers, 

b 1 I S 

say n k+ \ ^ max {n k , An k } for each k. For each k let H k C D& consist of equally 

— 1 h 

spaced equal intervals of lengths n k with the midpoints of consecutive intervals 
distance n k l apart. Then writing F — P)tli Hk, we have dimuF — s. 

Calculation. Since F C H k for each k, the set F D [0, 1] is contained in at most 

— l/v 

n k + 1 intervals of length n k , so Proposition 4.1 gives dimn(F D [0, 1]) ^ 

— 1 Is 

lim^ ^ log(w;. + 1)/— logMt — s. Similarly, dim H (F C\[n, n + 1]) ^ s for 
all n e Z, so dim H ^ ^ s as a countable union of such sets. 

Now let Eq — [0, 1] and, for k ^ 1, let E k consist of the intervals of H k that 
are completely contained in E k _\. Then each interval / of E k _\ contains at least 
n k \I\ — 2 ^ n k n k _ x —2^2 intervals of E k , which are separated by gaps of at 
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least n^ 1 — n k ^ jn k if k is large enough. Using Example 4.6, and noting 
that setting m^ — n k n k _ x rather than m^ — n k n k _ x — 2 does not affect the limit, 

E k ^ lim -2±-± _' k 

k=l k^oo -log(n k n k _ l jn k ) 

log (Ml ■■■«t-2) 1 " 1/ ' V +log«i_i 

= hm . 

*^^ log 2+ (logn t _i)/« 

Provided that n k is sufficiently rapidly increasing, the terms in lognjt_i in the 
numerator and denominator of this expression are dominant, so that dim H F ^ 
dimii(.F n [0, 1]) ^ s, as required. □ 

Although the Mass distribution principle 4.2 is based on a simple idea, we 
have seen that it can be very useful in finding Hausdorff and box dimensions. 
We now develop some important variations of the method. 

It is enough for condition (4.1) to hold just for sufficiently small balls centred 
at each point of F. This is expressed in Proposition 4.9(a). Although mass dis- 
tribution methods for upper bounds are required far less frequently, we include 
part (b) because it is, in a sense, dual to (a). Note that density expressions, such 
as linv^o n(B(x, r))/r s play a major role in the study of local properties of 
fractals — see Chapter 5. (Recall that B{x, r) is the closed ball of centre x and 
radius r.) 

We require the following covering lemma in the proof of Proposition 4. 9(b). 



Covering lemma 4.8 

Let C be a family of balls contained in some bounded region of W . Then there 
is a (finite or countable) disjoint subcollection {5,} such that 



IJ B C |J B, (4.8) 



BeC i 

where B t is the closed ball concentric with B, and of four times the radius. 

Proof. For simplicity, we give the proof when C is a finite family; the basic idea 
is the same in the general case. We select the {£,} inductively. Let By be a 
ball in C of maximum radius. Suppose that B\, . . , , Bi_i have been chosen. We 
take B k to be the largest ball in C (or one of the largest) that does not intersect 
B\, . . . , B k -\. The process terminates when no such ball remains. Clearly the 
balls selected are disjoint; we must check that (4.8) holds. If B e C, then either 
B — Bj for some i, or B intersects one of the 5, with |5,| ^ \B\; if this were 
not the case, then B would have been chosen instead of the first ball B k with 
[Bjt| < \B\. Either way, B C B h so we have (4.8). (It is easy to see that the 
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result remains true taking Bi as the ball concentric with Z?, and of 3 + e times 
the radius, for any e > 0; if C is finite we may take £ = 0.) □ 

Proposition 4.9 

Let [i be a mass distribution on W 1 , let F C W be a Borel set and let < c < oo 
be a constant. 

(a) If ImW)M(£(jt, r))/r s < c for all x e F then H S (F) > n(F)/c 

(b) If \im r ^ ix(B(x,r))/r s > c for all x e F then H S (F) < 2 s /x(H")/c. 

Proof 

(a) For each S > let 

F s = {x e F : /x(Z?(jt, r)) < cr s for all < r sC 5}. 

Let {[/,} be a <5-cover of F and thus of F s . For each I/,- containing a point 
x of F$, the ball Z? with centre x and radius |(/,| certainly contains (/,. By 
definition of F s , 

H(U t ) ^ n(B) < c\Ui\' 

so that 

KF S ) ^ J^iniUt) : Ui intersects F s ) ^ cJ^W. 

i i 

Since {(/,•} is any <5-cover of F, it follows that n(F s ) sC cH s s (F) sC cW(F). 
But Fs increases to F as S decreases to 0, so jx{F) ^ cH s (F) by (1.7). 

(b) For simplicity, we prove a weaker version of (b) with 2 s replaced by 8\ 
but the basic idea is similar. Suppose first that F is bounded. Fix S > 
and let C be the collection of balls 

{B(x, r) : x e F, < r ^ S and /i(Z?(x, r)) > cr 1 }. 

Then by the hypothesis of (fc) F C Usec^- Applying me Covering 
lemma 4.8 to the collection C, there is a sequence of disjoint balls /?, e C 
such that UbsC ^ *- U; ^i where 5, is the ball concentric with Z?, but of 
four times the radius. Thus {/?,} is an 8<5-cover of F, so 



^ 8*c _1 ^/i(B,) ^ 8 J c _1 /i(^")- 
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Letting S -+ 0, we get H S (F) sC & s c~ l li(R n ) < oo. Finally, if F is un- 
bounded and H"(F) > %"c ' ix(K n ), the ^'-measure of some bounded 
subset of F will also exceed this value, contrary to the above. □ 

Note that it is immediate from Proposition 4.9 that if rimr^ologMC^C"^ r ))l 
log r — s for all x e F then dim H F — s. 

Applications of Proposition 4.9 will occur throughout the book. 

We conclude this section by a reminder that these calculations can be used 
in conjunction with the basic properties of dimensions discussed in Chapters 2 
and 3. For example, since fix) — x 2 is Lipschitz on [0, 1] and bi-Lipschitz on 
[§, 1], it follows that dim H {.* 2 : x e C] — dim H /(C) = log 2/ log 3, where C is 
the middle third Cantor set. 



4.2 Subsets of finite measure 

This section may seem out of place in a chapter about finding dimensions. How- 
ever, Theorem 4.10 is required for the important potential theoretic methods 
developed in the following section. Sets of infinite measure can be awkward to 
work with, and reducing them to sets of positive finite measure can be a very 
useful simplification. 

Theorem 4.10 guarantees that any (Borel) set F with H S (F) — oo contains a 
subset E with < H S (E) < oo (i.e. with E an s-set). At first, this might seem 
obvious — just shave pieces off F until what remains has positive finite measure. 
Unfortunately it is not quite this simple — it is possible to jump from infinite 
measure to zero measure without passing through any intermediate value. Stating 
this in mathematical terms, it is possible to have a decreasing sequence of sets 
E x D E 2 D . . . with U s (E k ) = oo for all k, but with H J (f|£i E k ) = 0. (For a 
simple example, take E k — [0, 1/k] C R and < s < 1.) To prove the theorem 
we need to look rather more closely at the structure of Hausdorff measures. 
Readers mainly concerned with applications may prefer to omit the proof! 

Theorem 4.10 

Let F be a Borel subset of W with < Ti s (F) SC oo. Then there is a compact 
set E C F such that < TL S (E) < oo. 

* Sketch of proof. The complete proof of this is complicated. We indicate the ideas 
involved in the case where F is a compact subset of [0, 1) C K and < s < 1. 
We work with the net measures M s which are defined in (2.17)-(2.18) using 
the binary intervals [r2~ k , (r + 1)2~* : ) and are related to Hausdorff measure by 
(2.19). We define inductively a decreasing sequence Eq D Ex D £2 D ■ • • of com- 
pact subsets of F. Let Eq — F. For k ^ we define E k+1 by specifying its 
intersection with each binary interval / of length 2~ k . If M. s 2 ^ ik+1) (£* D /) ^ 2~ sk 
we let Ejfc+1 n / = Ei n /. Then 

M S 2HM) (E k+i n 7) = M 2 . k (E k n /) (4.9) 
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since using / itself as a covering interval in calculating M s 2 _, gives an esti- 
mate at least as large as using shorter binary intervals. On the other hand, if 
A4 s 2 - {t+l) (Ek+\ fl 7) > 2~ sk we take £^+1 D / to be a compact subset of E k P / 
with M s 2 _ ik+1) (E k+ i PI I) = 2~ sk . Such a subset exists since M s 2 _ ik+1) (E k Pi I Pi 
[0, u]) is finite and continuous in u. (This is why we need to work with the A4 S S 
rather than M s .) Since M 2 _ k (E k P I) = 2~ sk , (4.9) again holds. Summing (4.9) 
over all binary intervals of length 2~ k we get 

M s 2 . (M) (E k+1 ) = M s 2 . k (E k ). (4.10) 

Repeated application of (4.10) gives M 2 - k (E k ) — M\(Eq) for all k. Let E be 
the compact set Htlo ^k- Taking the limit as k — > 00 gives Ai s (E) = M\(E ) 
(this step needs some justification). The covering of Eq — F by the single 
interval [0, 1) gives M S (E) = M\(E Q ) ^ 1. Since M S (E Q ) ^ H S (E Q ) > we 
have M s 2 . k (E ) > if k is large enough. Thus either M S (E) = M\(E Q ) ^ 2~ ks , 
or M\(E ) < 2~ ks in which case M"{E) = M\(E ) = M s 2 _ k (E ) > 0. Thus 

< M S (E) < 00, and the theorem follows from (2.19). □ 

A number of results, for example those in Chapter 5, apply only to s-sets, 
i.e. sets with < H S (F) < 00. One way of approaching s -dimensional sets with 
H S (F) — 00 is to use Theorem 4.10 to extract a subset of positive finite measure, 
to study its properties as an s-set, and then to interpret these properties in the 
context of the larger set F. Similarly, if < s < t, any set F of Hausdorff 
dimension / has H S (F) — 00 and so contains an .s-set. 

The following proposition which follows from Proposition 4.9, leads to an 
extension of Theorem 4.10. 

Proposition 4.11 

Let F be a Borel set satisfying < 7i s (F) < 00. There is a constant b and a 
compact set E C F with H"(E) > such that 

H s (EPB(x,r))^br s (4.11) 

for all x e W and r > 0. 

Proof. In Proposition 4.9(b) take /x as the restriction of H s to F, i.e. /x(A) = 
W(F PA). Then, if 

Fi = \x e W : lim?f (F p B(x, r))/r s > 2 1+v 

it follows that ft' s (Fi)s;2' s 2- (1+s) /x(£) = \n s (F). Thus W s (F\Fi) > \H S (F) 
> 0,soifFi = F\FithenW ? (Fi) > 0andiim r ^ W' 5 (F P B(x, r))/r s ^ 2 1+s for 

1 6 £|.By Egoroff s theorem (see also Section 1.3) it follows that there is a com- 
pact set E C Fi with H S (E) > and a number r^ > such that 
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n s (F n B(x, r))/r s sC 2 2+s for all x e E and all < r ^ r . However, we 
have that H S (F n B(x,r))/r s ^H s (F)/r^ if r ^ r so (4.11) holds for all 

r > 0. D 

Corollary 4.12 

Lef F be a Borel subset of R." with < TL S (F) $C oo. 77ze« f/zere w a compact 
set E C F such that < TL S (E) < oo an<i a constant b such that 

H s (Er\B(x,r)) s; for 5 

/or a// jc e W and r > 0. 

Proof. Theorem 4.10 provides us with a compact subset F\ of F of positive finite 
measure, and applying Proposition 4.11 to F\ gives the result. □ 

Corollary 4.12, which may be regarded as a converse of the Mass distribution 
principle 4.2, is often called 'Frostman's lemma'. 

4.3 Potential theoretic methods 

In this section we introduce a technique for calculating Hausdorff dimensions 
that is widely used both in theory and in practice. This replaces the need for 
estimating the mass of a large number of small sets, as in the Mass distribution 
principle, by a single check for the convergence of a certain integral. 

The ideas of potential and energy will be familiar to readers with a knowledge 
of gravitation or electrostatics. For s ^ the s-potential at a point x of W due 
to the mass distribution ji on W is defined as 



f dud 

J \x- 



<Mv>- / dfl(y ] . (4-12) 

y\* 



(If we are working in IR 3 and s — 1 then this is essentially the familiar Newtonian 
gravitational potential.) The s-energy of /U. is 

h(ji) = j <Ps(x)dix(x) = jj ^ ^ y) . (4.13) 

The following theorem relates Hausdorff dimension to seemingly unconnected 
potential theoretic ideas. Particularly useful is part (a): if there is a mass distri- 
bution on a set F which has finite .v -energy, then F has dimension at least s. 

Theorem 4.13 

Let F be a subset of W . 

(a) If there is a mass distribution ji on F with /.,(/i) < oo then H S (F) — oo 
and dim H F ^ s. 
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(b) If F is a Borel set with H S (F) > then there exists a mass distribution \x 
on F with 7<(/x) < oo for all < t < s. 

Proof 

(a) Suppose that I s {p.) < oo for some mass distribution /i with support con- 
tained in F . Define 



Fi = { x e F : lim n(B(x, r))/r s > 

If x e Fj we may find e > and a sequence of numbers {r t } decreasing to 
such that ijl{B(x, r t )) ^ srj . Since ii({x}) — (otherwise F(/x) = oo) 
it follows from the continuity of jjl that, by taking q t (0 < g, < r,) small 
enough, we get fi(Aj) > ^srj (i — 1,2, . . .), where A, is the annulus 
B(x,ri)\B(x,qi). Taking subsequences if necessary, we may assume that 
fi+i < qt for all i, so that the A,- are disjoint annuli centred on x. Hence 
for all x&Fi 



00 

> J2 l 4 sr t r r = °° 



dn(y) 



,-_, -A t \x-y\ 



i=\ 

since |x — y\~ s ^ r~' s on A,-. But /,(/x) = f 4> s (x)d/i(x) < oo, so 
0,(x) < oo for /^-almost all x. We conclude that n(F\) = 0. Since 
lim r _*o/i,(B(je, r ))/ rS — if x e F\F\, Proposition 4.9(a) tells us that, 
for all c > 0, we have 

H S (F) £ n s (F\F r ) £ H(F\Fi)/c 2 (M(F) - /x(F,))/c = fi(F)/c. 

Hence ft' (F) = oo. 
(b) Suppose that 7i s (F) > 0. We use TL S to construct a mass distribution \i 
on F with / ( (/x) < oo for every t < s. 

By Corollary 4.12 there exist a compact set E C F with < H S (E) < 
oo and a constant b such that 

H'(£nB(j,r)) ^ br s 

for all jc e R" and r > 0. Let /x be the restriction of TL S to F, so that 
/x(A) = H. s (En A); then /x is a mass distribution on F. Fix jc e W 
and write 

m(r) = /m(B(x, r)) = ?f (F n B(*, r)) ^ for 5 . (4.14) 
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Then, if < t < s 

&(*)=/ 1 [7+/ 1 H 

J\x-y\n\ \x - y\' J\x-y\>\ \x - yv 

sC / r-'dm(r) + fi(R") 

Jo 

= [r-'m(r)] l 0+ + t I r~ {t+l) m{r)dr + n(R") 
Jo 

s^b + bt I r s -' _1 dr + /i(R n ) 

Jo 

= >(l + -±-) + W(F) = c. 

say, after integrating by parts and using (4.14). Thus <p t (x) ^ c for all 
x e R", so that I t (ji) = f c/) l (x)dfi(x) s; cfi(R n ) < oo. D 

Important applications of Theorem 4.13 will be given later in the book, for 
example, in the proof of the projection theorems in Chapter 6 and in the deter- 
mination of the dimension of Brownian paths in Chapter 16. The theorem is often 
used to find the dimension of fractals Fg which depend on a parameter 8 . There 
may be a natural way to define a mass distribution fig on Fg for each 8. If we 
can show, that for some s, 



f •*»»»- Iff 



dfi e (x)d[ie(y)dO 

< oo, 

\x - y\> 



then I s (n$) < oo for almost all 8, so that dim H F e ^ s for almost all 8. 

Readers familiar with potential theory will have encountered the definition of 
the s-capacity of a set F : 

C S (F) — sup{l// s (/x) : /x is a mass distribution on F with fi(F) — 1} 

(with the convention that l/oo = 0). Thus another way of expressing Theo- 
rem 4.13 is 

dim H F = inf{s ^ : C S (F) = 0} = sup{s ^ : C S (F) > 0}. 

Whilst this is reminiscent of the definition (2. 1 1) of Hausdorff dimension in terms 
of Hausdorff measures, it should be noted that capacities behave very differently 
from measures. In particular, they are not generally additive. 
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*4.4 Fourier transform methods 

In this section, we do no more than indicate that Fourier transforms can be a 
powerful tool for analysing dimensions. 

The n-dimensional Fourier transforms of an integrable function / and a mass 
distribution /x on E." are defined by 

f(u)= I f(x)exp(bc-u)dx (u e W) (4.15) 

jX(u) = / exp(ix ■ u)d/i(x) (u e W) (4.16) 

where x ■ u represents the usual scalar product. (Fourier transformation extends 
to a much wider class of function using the theory of distributions.) 
The i'-potential (4.12) of a mass distribution /i is just the convolution 



/"" 



0,M = (I ■ T * d)W -J\x- yrd»«0>). 

Formally, the transform of \x\~ s may be shown to be c\u\ s ~ n , where c depends 
on n and s, so the convolution theorem, which states that the transform of the 
convolution of two functions equals the product of the transforms of the func- 
tions, gives 

<f) s {u) — c\u\ s ~ n jX{u). 

Parseval's theorem tells us that 

/ 0, s (x)d/x(x) = (2it) n I <p s (u)fjL(u)&u 

where the bar denotes complex conjugation, so 

I s (ji) = (2tt)"c / |w|*-"|/i(«)| 2 d«. (4.17) 

This expression for I s (/i), which may be established rather more rigorously, is 
sometimes a convenient way of expressing the energy (4.13) required in Theo- 
rem 4.13. Thus if there is a mass distribution /x on a set F for which the integral 
(4.17) is finite, then dim H F ^ s. In particular, if 

\jX(u)\ ^ b\u\-' /2 (4.18) 

for some constant b, then, noting that, by (4.16), |/2(m)| ^ /x(IR") for all u, we 
have from (4.17) that 

I s (lA)^C\\ \u\ s ~ n du + c 2 j \u\ s - n \u\-'du 
i|«Kl J\u\>l 
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which is finite if < s < t. Thus if (4.18) holds, any set F which supports /i has 
Hausdorff dimension at least /. The greatest value of t for which there is a mass 
distribution /x on F satisfying (4.18) is sometimes called the Fourier dimension 
of F, which never exceeds the Hausdorff dimension. 



4.5 Notes and references 

Many papers are devoted to calculating dimensions of various classes of fractal, 
for example the papers of Eggleston (1952), Beardon (1965) and Peyriere (1977) 
discuss fairly general constructions. 

The potential theoretic approach was, essentially, due to Frostman (1935); 
see Taylor (1961), Hayman and Kennedy (1976), Carleson (1967) or Kahane 
(1985) for more recent accounts. For an introduction to Fourier transforms see 
Papoulis (1962). 

The work on subsets of finite measure originates from Besicovitch (1952) 
and a very general treatment is given in Rogers (1998). Complete proofs of 
Theorem 4.10 may be found in Falconer (1985a) and Mattila (1995). 

Subsets of finite positive packing measure are investigated by Joyce and 
Preiss (1995). 



Exercises 



?2 



4.1 What is the Hausdorff dimension of the 'Cantor tartan' given by {(x,y) e 
either x e F or y e F} where F is the middle third Cantor set? 

4.2 Use the mass distribution principle and a natural upper bound to show that the set of 
numbers in [0, 1] containing only even digits has Hausdorff dimension log 5/ log 10. 

4.3 Use the mass distribution method to show that the 'Cantor dust' depicted in figure 0.4 
has Hausdorff dimension 1. (Hint: note that, taking the square Eq to have side 1, 
any two squares in the set E k of the construction are separated by a distance of at 
least 4-*.) 

4.4 Fix < X ^ i and let F be the set of real numbers 



{00 
^ a k k k :a k = or 1 for fe = 1,2, 
k=\ 



Find the Hausdorff and box dimensions of F . 

4.5 What is the Hausdorff dimension of F x F C IR 2 , where F is the middle third 
Cantor set? 

4.6 Let F be the middle third Cantor set. What is the Hausdorff dimension of the plane 
set given by {(x, y) e K 2 : x e F and < y < x 2 }? 

4.7 Use a mass distribution method to obtain the result of Example 4.5 directly rather 
than via Example 4.4. 
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4.8 Show that every number x > may be expressed in the form 

x = m H 1 — - + • • ■ 

2! 3! 

where m > is an integer and a^ is an integer with ^ a^ ^ k — \ for each &. Let 
F = {x > : m = and a^ is even for fc = 2, 3, . . .}. Find dimH-F. 

4.9 Show that there is a compact subset F of [0, 1] of Hausdorff dimension 1 but 
with TO (F) = 0. (Hint: try a 'Cantor set' construction, but reduce the proportion of 
intervals removed at each stage.) 

4.10 Deduce from Theorem 4.10 that if F is a Borel subset of K" with H"(F) = oo and 
c is a positive number, then there is a Borel subset E of F with W(E) = c. 

4.11 Let n be the natural mass distribution on the middle third Cantor set F (see after 
Principle 4.2). Estimate the .v-energy of fi for s < log 2/ log 3 and deduce from 
Theorem 4.13 that dim H F > log 2/ log 3. 



Chapter 5 Local structure of fractals 



Classical calculus involves finding local approximations to curves and surfaces 
by tangent lines and planes. Viewed on a large scale the neighbourhood of a 
point on a smooth curve appears close to a line segment. Can we say anything 
about the local structure of as diverse a class of objects as fractals? Surprisingly, 
the answer in many cases is yes. We can go some way to establishing the form 
of fractals in a neighbourhood of a general point. In particular, we can study the 
concentration of fractals about typical points; in other words, their local densities, 
and the directional distribution of fractals around points including the question 
of whether tangents exist. A knowledge of the local form of a fractal is useful 
both in developing theory and in applications. 

In order to realize the power of Hausdorff measures, it is necessary to 
restrict attention to s-sets, i.e. Borel sets of Hausdorff dimension s with positive 
finite .s -dimensional Hausdorff measure. (More generally, it is possible to work 
with 5 -sets of positive finite Tt h -measure for some dimension function h; 
see Section 2.5 — we do not consider this generalization here.) This is not so 
restrictive as it first appears. Many fractals encountered in practice are s-sets, 
but even if H S (F) — oo then, by Theorem 4.10, F has subsets that are s-sets 
to which this theory can be applied. Alternatively, it sometimes happens that a 
set F of dimension s is a countable union of s-sets, and the properties of these 
component sets can often be transferred to F . 

The material outlined in this chapter lies at the heart of geometric measure 
theory, a subject where rigorous proofs are often intricate and difficult. We omit 
the harder proofs here; it is hoped that those included will be found instructive 
and will give the flavour of the subject. We generally restrict attention to subsets 
of the plane — the higher-dimensional analogues, though valid, are often an order 
of magnitude harder. 

5.1 Densities 

Let F be a subset of the plane. The density of F at x is 

area(F D B(x, r)) are.a(F D B(x, r)) 

lim = lim (5.1) 

r^o area(/J(x, r)) r-*o jrr 1 
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where B{x,r) is the closed disc of radius r and centre x. The classical Lebesgue 
density theorem tells us that, for a Borel set F, this limit exists and equals 1 
when x e F and when x £ F, except for a set of x of area 0. In other words, 
for a typical point i off, small discs centred at x are almost entirely filled by 
F, but if x is outside F then small discs centred at x generally contain very little 
of F; see figure 5.1. 

Similarly, if F is a smooth curve in the plane and x is a point of F (other than 
an endpoint), then F D B(x, r) is close to a diameter of B(x, r) for small r and 



lim 



length (Fn B (x,r)) 
2r 



If x £ F then this limit is clearly 0. 

Density theorems such as these tell us how much of the set F, in the sense of 
area or length, is concentrated near x. In the same way it is natural to investigate 
densities of fractals — if F has dimension s, how does the ^-dimensional Haus- 
dorff measure of F D B(x, r) behave as r — > 0? We look at this question when 
F is an s-set in R 2 with < s < 2 (0-sets are just finite sets of points, and there 
is little to say, and H 2 is essentially area, so if s — 2 we are in the Lebesgue 
density situation (5.1)). 

We define the lower and upper densities of an s-set F at a point x e W as 



D s (F,x) = lim 

r->0 



W(FnB(x,r)) 
(2ry 



(5.2) 



and 



D (F, x) = lim 



H s (FnB(x,r)) 

(2ry 



(5.3) 




Figure 5.1 The Lebesgue density theorem. The point x is in F, and area(F n B(x, r))l 
area (B(x, r)) is close to 1 if r is small. The point y is outside F, and area (F n B(y, r))l 
area (B(y, r)) is close to if r is small 
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respectively (note that \B(x, r)\ — 2r). If Df(F, x) — D (F, x) we say that the 
density of F at x exists and we write D S (F, x) for the common value. 

A point x at which Ff(F, x) — D (F, x) = 1 is called a regular point of F, 
otherwise x is an irregular point. An s-set is termed regular if W -almost all of 
its points (i.e. all of its points except for a set of TL S -measure 0) are regular, and 
irregular if H" -almost all of its points are irregular. (Here 'irregular' does not 
mean 'not regular' !) As we shall see, a fundamental result is that an s-set F must 
be irregular unless s is an integer. However, if s is integral an s-set decomposes 
into a regular and an irregular part. Roughly speaking, a regular 1-set consists of 
portions of rectifiable curves of finite length, whereas an irregular 1-set is totally 
disconnected and dust-like, and typically of fractal form. 

By definition, a regular set is one for which the direct analogue of the Lebesgue 
density theorem holds. However, even the densities of irregular sets cannot 
behave too erratically. 

Proposition 5.1 

Let F be an s-set in W 1 . Then 

(a) Ff(F, x) = ~L)\f, x) = for W -almost all x $ F 

(b) 2~ s sC 15° (F, x) sC I for H '-almost all x e F. 

Partial proof 

(a) If F is closed and x $ F, then B(x, r) H F — if r is small enough. 
Hence linv^o'H'' (F D B(x, r))/(2r) s = 0. If F is not closed the proof is 
a little more involved and we omit it here. 

(b) This follows quickly from Proposition 4.9(a) by taking \i as the restriction 
of W to F, i.e. /i(A) = H S (F n A): if 

f — s n s (FnB(x,r)) 

F x = \xeF:D(F,x) = lim — -^— - < 2~' s c 

then H S {F X ) ^ W(F)/c ^ H s (F x )/c. If < c < 1 this is only possible if 
71 s (F\) — 0; thus for almost all x e F we have D (F, x) ^ 2~ s , The upper 
bound follows in essentially the same way using Proposition 4.9(b). □ 

Note that an immediate consequence of Proposition 5.1(b) is that an irregular 
set has a lower density which is strictly less than 1 almost everywhere. 

We will sometimes need to relate the densities of a set to those of certain 
subsets. Let F be an s-set and let £ be a Borel subset of F. Then 

H S (F n B(x, r)) _ W(E n B(x, r)) H S ((F\E) n B(x, r)) 
(Try ~ (2rj^ (2/0* ' 
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For almost all x in E, we have 

H s ((F\E)nB(x,r)) 

> as r —> 

(2ry 

by Proposition 5.1(a), so letting r —> gives 

D S (F, x) = D S (E, x); D S (F, x) = D S (E, x) (5.4) 

for H s -almost all x in E. Thus, from the definitions of regularity, if £ is a subset 
of an s-set F with H S (E) > 0, then E is regular if F is regular and E is irregular 
if F is irregular. In particular, the intersection of a regular and an irregular set, 
being a subset of both, has measure zero. 

Estimates for lower densities are altogether harder to obtain, and we do not 
pursue them here. 

In general it is quite involved to show that s-sets of non-integral dimension are 
irregular, but in the case < s < 1 the following 'annulus' proof is appealing. 

Theorem 5.2 

Let F be an s-set in R 2 . Then F is irregular unless s is an integer. 

Partial proof. We show that F is irregular if < s < 1 by showing that the density 
D s (F,x) fails to exist almost everywhere in F. Suppose to the contrary: then 
there is a set F\ C F of positive measure where the density exists and therefore 
where j < 2~ s $C D s (F,x), by Proposition 5.1(b). By Egoroff's theorem (see 
Section 1.3) we may find r$ > and a Borel set E C F\ C F with H S (E) > 
such that 

K s (Fr\B(x,r)) > i(2r) s (5.5) 

for all x e E and r < r^. Let y e E be a cluster point of E (i.e. a point y with 
other points of E arbitrarily close). Let n be a number with < n < 1 and let 
A riV be the annulus B(y, r(\ + rj))\B(y, r(l — n)); see figure 5.2. Then 

(2ry s H s (F n A,,,) = (2ry s H s (F n B(y, r(l + n))) 
-(2r)- s H s (FnB(y,r(l-r l ))) 
-+D s (F,y)((l + r,y-(l-r)y) (5.6) 

as r — > 0. For a sequence of values of r tending to 0, we may find x e E with 
\x - y\ — r. Then B(x, \rr\) C A r> , so by (5.5) 

\r s rf < H S (F n B(x, \rn)) ^ H s (F n A,.,). 
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Figure 5.2 The 'annulus' proof of Theorem 5.2 



Combining with (5.6) this implies that 



2—y ^ D S (F, y)((l + rif - (1 - »,)') 



= D S (F, y)(2st) + terms in 77 or higher). 

Letting r\ — > we see that this is impossible when < s < 1 and the result 
follows by contradiction. □ 



5.2 Structure of 1-sets 

As we have pointed out, sets of non-integral dimension must be irregular. The 
situation for sets of integral dimension is more complicated. The following 
decomposition theorem, indicated in figure 5.3, enables us to split a 1-set into 
a regular and an irregular part, so that we can analyse each separately, and 
recombine them without affecting density properties. 



Decomposition theorem 5.3 

Let F be a 1-set. The set of regular points of F forms a regular set, the set of 
irregular points forms an irregular set. 
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Figure 5.3 Decomposition of a 1-set into a regular 'curve-like' part and an irregular 
'curve-free' part 

Proof. This is immediate from (5.4), taking E as the set of regular and irregular 
points respectively. □ 

Examples of regular and irregular 1-sets abound. Smooth curves are regular, 
and provide us with the shapes of classical geometry such as the perimeters of 
circles or ellipses. On the other hand the iterated construction of figure 0.4 gives 
an irregular 1-set which is a totally disconnected fractal. This is typical — as we 
shall see, regular 1-sets are made up from pieces of curve, whereas irregular 
1-sets are dust-like and 'curve-free', i.e. intersect any (finite length) curve in 
length zero. 

To study 1-sets we need a few facts about curves. For our purposes a curve 
or Jordan curve C is the image of a continuous injection (one-to-one function) 
xfr : [a, b] — >■ IR 2 , where [a, b] C R is a proper closed interval. According to our 
definition, curves are not self-intersecting, have two ends, and are compact con- 
nected subsets of the plane. The length C{C) of the curve C is given by polygonal 
approximation: 



C(C) = SUp^|X; -*,-_! | 



where the supremum is taken over all dissections of C by points Xq, X\, . . . ,x m 
in that order along the curve. If the length C(C) is positive and finite we call C 
a rectifiable curve. 

As one might expect, the length of a curve equals its 1 -dimensional Haus- 
dorff measure. 
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Lemma 5.4 

If C is a rectifiable curve then H l (C) — C(C). 

Proof. For x, y e C, let C Xt> . denote that part of C between x and y. Since orthog- 
onal projection onto the line through x and y does not increase distances, (2.9) 
gives V}(C xy ) ^ Ti x \x, y] = \x — y\, where [x, y] is the straight-line segment 
joining x to y. Hence for any dissection Xq, X\, . . . , x m of C, 



^2 \ Xi ~ x >~ 



^J^nHc^j^nHc) 



; = 1 



so that £(C) ^ H l (C). On the other hand, let / : [0, £(C)] ^ C be the mapping 
that takes / to the point on C at distance / along the curve from one of its ends. 
Clearly \f(t) - f(u)\ ^ \t — u\ for ^ t, u ^ £(C), that is / is Lipschitz and 
U\C) ^ 7i l [0, C(C)] = C(C) by (2.9) as required. □ 

It is straightforward to show that rectifiable curves are regular. 

Lemma 5.5 

A rectifiable curve is a regular 1-set. 

Proof. If C is rectifiable, C(C) < oo, and since C has distinct endpoints p and q, 
we get C{C) ^ \p — q\ > 0. By Lemma 5.4, < 7i l {C) < oo, so C is a 1-set. 
A point x of C that is not an endpoint, divides C into two parts C IKX and 
C x j? . If r is sufficiently small, then moving away from x along the curve C xq 
we reach a first point y on C with \x — y| = r. Then C x , y C 5(x, r) and 

r = \x - y\^ C(C x , y ) = K\C x , y ) ^ H\C x , q n B(x, r)). 

Similarly, r < H l (C PiX n 5(jc, r)), so, adding, 2r ^H'(Cn 5(x, r)), if r is 
small enough. Thus 

. wHcns^.r)) 

D\C,x) = lim — — — - ^ 1. 

By Proposition 5.1(£>) D_ l (C,x) SC D (C, x) SC 1, so D x {C,x) exists and equals 
1 for all x e C other than the two endpoints, so C is regular. □ 

Other regular sets are easily constructed. By (5.4), subsets of regular sets, and 
unions of regular sets should also be regular. With this in mind we define a 1-set 
to be curve-like if it is contained in a countable union of rectifiable curves. 



Proposition 5.6 

A curve-like 1-set is a regular 1-set. 
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Proof. If F is a curve-like 1-set then F C U;°^i Ci where the C, are rectifiable 
curves. For each j and 7i' -almost all x e F n C, we have, using Lemma 5.5 and 
equation (5.4), 

1 = D}(Ci,x) = D 1 (Fr\C i ,x) ^D l (F,x) 

and hence 1 $C D}(F,x) for almost all x e F. But for almost all x e F we 
have D}(F,x) < Z) (F, x) < 1 by Proposition 5.1, so D l (F,x) — 1 almost 
everywhere, and F is regular. □ 

It is natural to introduce a complementary definition: a 1-set is called curve-free 
if its intersection with every rectifiable curve has H 1 -measure-zero. 

Proposition 5.7 

An irregular 1-set is curve-free. 

Proof. If F is irregular and C is a rectifiable curve then F n C is a subset of 
both a regular and an irregular set, so has zero H l -measure. □ 

These two propositions begin to suggest that regular and irregular sets might be 
characterized as curve-like and curve-free respectively. This is indeed the case, 
but it is far from easy to prove. The crux of the matter is the following lower- 
density estimate, which depends on an intricate investigation of the properties of 
curves and connected sets and some ingenious geometrical arguments. 

Proposition 5.8 

Let F be a curve-free l-set in IR 2 . Then D l (F, x) $C | at almost all x e F. 

Proof. Omitted. □ 

Assuming this proposition, a complete characterization of regular and irregular 
sets is relatively easy. 

Theorem 5.9 

(a) A 1-set in R 2 is irregular if and only if it is curve-free. 

(b) A l-set in R 2 is regular if and only if it is the union of a curve-like set and 
a set of H 1 -measure zero. 

Proof 

(a) A curve-free set must be irregular by Proposition 5.8. Proposition 5.7 pro- 
vides the converse implication. 

(b) By Proposition 5.6 a curve-like set is regular, and adding in a set of 
measure zero does not affect densities or, therefore, regularity. 
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If F is regular, then any Borel subset E of positive measure is regular with 
D}(E,x) — 1 for almost all x e E. By Proposition 5.8 the set E cannot 
be curve-free, so some rectifiable curve intersects £ in a set of positive 
length. We use this fact to define inductively a sequence of rectifiable 
curves {C,}. We choose C\ to cover a reasonably large part of F, say 

Ti. 1 (F n d) > \ sup {H x (F n C) : C is rectifiable} > 0. 

If Ci, . . . , C k have been selected and F# = F\ U;=i Q nas positive mea- 
sure, let Ck+i be a rectifiable curve for which 

U\F k n C k+ i) ^ \ sup {U\F k n C) : C is rectifiable} > 0. (5.7) 

If the process terminates then for some k the curves C\, . . . ,C k cover 
almost all of F and F is curve-like. Otherwise, 

oo>n 1 (F)>J^H 1 (F k nC k+ i) 

since the F k D C* + i are disjoint, so that H x {F k H Q+i) — > as fc — >■ oo. 
If W'^XU^i C<) > there is a rectifiable curve C such that H l ((F\ 
U/^i Q) ^ C) — ^ f° r some d > 0. But 7i l (F k n Q + i) < jJ for some 
A:, so, according to (5.7), C would have been selected in preference to 
Cfc + i. Hence H^FXU^j C,) = 0, and F consists of the curve-like set 
F n Uj=i C,- together with F\ U i= i ^,, which is of measure zero. D 

Thus regular 1-sets are essentially unions of subsets of rectifiable curves, but 
irregular 1-sets contain no pieces of rectifiable curves at all. This dichotomy is 
remarkable in that the definition of regularity is purely in terms of densities and 
makes no reference to curves. Propositions 5.6 and 5.8 provide a further contrast. 
Almost everywhere, a regular set has lower density 1, whereas an irregular set 
has lower density at most |, Thus in any 1-set F the set of points for which 
| < D (F, x) < 1 has ^'-measure zero. 

Regular 1-sets may be connected but, like sets of dimension less than 1, irreg- 
ular 1-sets must be totally disconnected. We know at least that distinct points 
cannot be joined by a rectifiable curve in an irregular set, and further investigation 
shows that no two points can lie in the same connected component. 

Further differences between regular and irregular sets include the existence of 
tangents (see Section 5.3) and projection properties (see Chapter 6). In all these 
ways, the classes of regular and irregular 1-sets are distanced from each other. 
For the special case of 1-sets, it would make sense mathematically to define 
fractals to be those sets which are irregular. 



5.3 Tangents to s-sets 

Suppose that a smooth curve C has a tangent (in the classical sense) at x. This 
means that close to x the set C is concentrated in two diametrically opposite 
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directions. What can be said about the directional distribution of an s-set about a 
typical point? Is there a meaningful definition of a tangent to an s-set, and when 
do such tangents exist? 

Any generalization of the definition of tangents should reflect the local direc- 
tional distribution of sets of positive measure — for sets of the complexity that 
we have in mind, there is no hope of a definition involving all nearby points; 
we must be content with a condition on almost all points. We say that an s-set 
F in W has a tangent at x in direction (6 a unit vector) if 



D (F,x) > 



(5.8) 



and, for every angle <p > 0, 



lim r~ s n s (F n (B(x, r)\S(x, 0, cp))) 

r— ±Q 



o 



(5.9) 



where S(x, 0, <p) is the double sector with vertex x, consisting of those y such 
that the line segment [x, y] makes an angle at most <p with or —0; see figure 5.4. 
Thus, for a tangent in direction 9, (5.8) requires that a significant part of F lies 
near x, of which, by (5.9), a negligible amount close to x lies outside any double 
sector S{x, , cp); see figure 5.5. 

We first discuss tangents to regular 1-sets in the plane, a situation not far 
removed from the classical calculus of curves. 

Proposition 5.10 

A rectifiable curve C has a tangent at almost all of its points. 



—i 



Proof. By Lemma 5.5 the upper density D (C, x) = 1 > for almost all x e 
C. We may reparametrize the defining function of C by arc length, so that 




Figure 5.4 The double sector Six, 0, cp) 
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s(x, e, <p) 




Figure 5.5 For F to have a tangent in direction 9 at x, there must be a negligible part 
of F in B(x, r)\S(x, 0, cp) (shaded) for small r 

xfr : [0, £(C)] — >■ W. 2 gives ijr(t) as the point distance t along C from the end- 
point i/f(0). To say that C{C) < oo simply means that \j/ has bounded variation, 
in other words supYl?=\ WOt) — iK^'-i)! < °° where the supremum is over 
dissections = to < t\ < ■ ■ ■ < t m — C(C). We quote a standard result from the 
theory of functions, that functions of bounded variation are differentiable almost 
everywhere, so V'(0 exists as a vector for almost all /. Because of the arc- 
length parametrization, \i]/'(t)\ — 1 for such t. Hence at almost all points ijf(t) 
on C, there exists a unit vector such that lim M ^ r (i/r(i<) — \jj(t))/{u — t) — 0. 
Thus, given <p > 0, there is a number e > such that 4r(u) e S(\[r(t), 0, <p) 
whenever \u — t\ < e. Since C has no double points we may find r such that 
f(u) i B(f(t), r) if \u - t\ ^ e, so C C\ (B(f(t), r)\S(f(t), 0, <p)) is empty. 
By the definitions (5.8) and (5.9), the curve C has a tangent at i[r it). Such points 
account for almost all points on C. □ 

Just as with densities, we can transfer tangency properties from curves to 
curve-like sets. 



Proposition 5.11 

A regular 1-set F in IR 2 has a tangent at almost all of its points. 

Proof. By definition of regularity, D (F, x) = 1 > at almost all x e F. 

If C is any rectifiable curve, then for almost all x in C there exists such that 
if <p > 



lim r- l H\{F n C) n (B(x, r)\S(x, 0, cp))) 

r— >0 

^ lim r~ l Ti l (C n (B(x, r)\S(x, 0, cp))) = 

r— *Q 
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by Proposition 5.10. Moreover 

lim r- l H l ({F\C) n (B(x, r)\S(x, 6, q>))) 

^ ]imr- l n 1 ((F\C)nB(x,r)) = 

for almost all x e C by Property 5.1(a). Adding these inequalities 
\imr- y n\F n(B(x,r)\S(x,e,<p))) = 

r->0 

for almost all x e C and so for almost all x e F D C. Since a countable collection 
of such curves covers almost all of F, the result follows. □ 

In contrast to regular sets, irregular 1-sets do not generally support tangents. 

Proposition 5.12 

At almost all points of an irregular l-set, no tangent exists. 

Proof. The proof, which depends on the characterization of irregular sets as curve- 
free sets, is too involved to include here. □ 

We turn now to 5 -sets in R 2 for non-integral s, which, as we have seen, 
are necessarily irregular. For < s < 1 tangency questions are not particularly 
interesting, since any set contained in a smooth curve will automatically satisfy 
(5.9) with the direction of the tangent to the curve at x. For example, the middle 
third Cantor set F regarded as a subset of the plane is a (log2/log3)-set that 
satisfies (5.8) and (5.9) for all x in F and <p > 0, where is a vector pointing 
along the set. On the other hand, if F, say, is a Cartesian product of two uniform 
Cantor sets, each formed by repeated removal of a proportion a > i from the 
centre of intervals, then a little calculation (see Chapter 7) shows that F is an 
s-set with s — 21og2/log(2/(l — a)) < 1 with no tangents at any of its points. 

It is at least plausible that s-sets in U. 2 with 1 < 5 < 2 do not have tan- 
gents — such sets are so large that they radiate in many directions from a typical 
point, so that (5.9) cannot hold. This is made precise in the following proposition. 

Proposition 5.12 

If F is an s-set in WL 2 with 1 < s < 2, then at almost all points of F, no tan- 
gent exists. 

Proof. For r Q > let 

E = {y e F : H S (F n B(y, r)) < 2(2r) s for all r < r }. (5.10) 



88 Local structure of fractals 

For each x e E, each unit vector and each angle <p with < cp < jit, we 
estimate how much of E lies in B(x, r) D S(x, 9, <p). For r < ro/20 and i — 
1,2,... let A, be the intersection of the annulus and the double sector given by 

Ai = (B(x, irip)\B(x, (i - \)rcp)) n S(x, 0, cp). 

Then B(x, r) D S(x, 0, <£>) C U;=i ^i U M f° r some integer m < 2/<p, Each A,- 
comprises two parts, both of diameter at most lOrip < r$, so applying (5.10) to 
the parts that contain points of E, and summing, 

H S (E nB(x,r)nS(x,0,(p))^ 2m2(20rcpY sC (4(p~ l )2(20r(p) s 

so that 

(2r)- s H s (E n B(x, r) n S{x, 0, cp)) ^ 8.10V" 1 (5.11) 

if r < ro/20. 

Now, almost all x e E satisfy D (F\E, x) — by Proposition 5.1(a). Decom- 
posing F n B(x, r) into three parts we get 

H S (F n B(x, r)) = H S ((F\E) n B(x, r)) + H J (£ n 5(x, r) n 5(jc, 0, <?)) 

+ W(£n(5(x,r)\5(jc,6l,^))). 

Dividing by (2r) v and taking upper limits as r —> 0, 

ZT(F, jc) ^ + 8.10V -1 + Yim{2rT s H s (F n (5(jc, r)\S(jc, 0, <p))) 

r— >0 

for almost all x e E, using (5.11). Choosing <p sufficiently small, it follows that 
for all (5.8) and (5.9) cannot both hold, so no tangent exists at such x. To 
complete the proof, we note that almost all x e F belong to the set E defined in 
(5.10) for some r$ > 0, by Proposition 5.1(b). □ 

The results of this chapter begin to provide a local picture of fractals that 
are s-sets. By taking these methods rather further, it is possible to obtain much 
more precise estimates of densities and also of the directional distributions of 
s-sets about typical points. For example, it may be shown that if s > 1, almost 
every line through 7-T-almost every point of an s-set F intersects F in a set of 
dimension 5—1. 

Recently, packing measures (see Section 3.4) have been employed in the study 
of local properties, and it has been shown that regularity of a set corresponds 
closely to the equality of its packing measure and (slightly modified) Haus- 
dorff measure. 

These ideas extend, albeit with considerable effort, to higher dimensions. Reg- 
ular s-sets in E." may be defined using densities and, again, s-sets can only be 
regular if s is an integer. Regular s-sets have tangents almost everywhere, and are 
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\v-dimensional-surface-like' in the sense that, except for a subset of Ti s -measure 
zero, they may be covered by a countable collection of Lipschitz images of 

subsets of W . 



5.4 Notes and references 

This chapter touches the surface of a deep area of mathematics known as geomet- 
ric measure theory. It has its origins in the fundamental papers of Besicovitch 
(1928, 1938) which contain a remarkably complete analysis of 1-sets in the 
plane. The results on s -sets in the plane for non-integral s are due to Marstrand 
(1954a). A succession of writers have extended this work to subsets of higher- 
dimensional space, culminating in the paper of Preiss (1987) which solved many 
of the outstanding problems. More detailed discussions of regular and irregu- 
lar sets may be found in Falconer (1985a), Mattila (1995) and Federer (1996). 
Modern approaches use tangent measures, introduced by Preiss (1987), which 
are described in Mattila (1995) and Falconer (1997). It is widely believed that 
the 'I' in Proposition 5.8 can be replaced by 'i'. Farag (2002) has shown this 
in many cases, and Preiss and Tiser (1992) obtain (2 + V46)/12 = 0.732 ... in 
general. 



Exercises 

5.1 By applying Proposition 5.1 with s = n = 2, deduce the Lebesgue density theo- 
rem (5.1). 

5.2 Let / : R. — »• D& be a continuously differentiable function such that < C\ ^ f'(x) < 
c 2 for all x. Show that, if F is an s-set in [R, then D s (f(F), f(x)) = D S (F, x) for 
all x in K, with a similar result for upper densities. 

5.3 Let F be the middle third Cantor set. Show that D' V (F, x) < 2~ s for all x, where 
s = log 2/ log 3. Deduce that F is irregular. 

5.4 Estimate the upper and lower densities at points of the 1-set depicted in figure 0.4 
and show that it is irregular. 

5.5 Adapt the proof of Theorem 5.2 to show that if F is an s-set with < s < 1, then 
D S (F, x) sC (1 + 2 ! <">- 1) ) s - 1 for almost all x. 

5.6 Construct a regular 1-set that is totally disconnected. (Hint: start with an interval.) 

5.7 Let E and F be .s-sets in K 2 such that for every disc B(x, r) we have that 
H s (B(x,r)nE)^H s (B(x,r)r\F). Show that H s (E\F)=0. Need we have 
E C Fl 

5.8 Let Fi, F 2 , ... be 1-sets in the plane such that F = U£=i Fk is a 1-set. Show that 
if Fk is regular for all k then F is regular, and if F^ is irregular for all k then F 
is irregular. 

5.9 Show that if £ is a regular 1-set and F an irregular 1-set then H l (E n F) = 0. 



Chapter 6 Projections of fractals 



In this chapter we consider the orthogonal projection or 'shadow' of fractals in W 
onto lower-dimensional subspaces. A smooth (1 -dimensional) curve in [R 3 gener- 
ally has a (1 -dimensional) curve as its shadow on a plane, but a (2-dimensional) 
surface or (3-dimensional) solid object generally has a 2-dimensional shadow, as 
in the upper part of figure 6.1. We examine analogues of this for fractals. Intui- 
tively, one would expect a set F in [R 3 to have plane projections of dimension 
2 if dim n F > 2 and of dimension dim H F if dim^F < 2, as in the lower part 
of figure 6.1. Roughly speaking this is correct, but a precise formulation of the 
projection properties requires some care. 

We prove the projection theorems in the simplest case, for projection of subsets 
of the plane onto lines, and then state the higher-dimensional analogues. 

6.1 Projections of arbitrary sets 

Let Lg be the line through the origin of [R 2 that makes an angle 9 with the 
horizontal axis. We denote orthogonal projection onto Lg by proj e , so that if F is 
a subset of [R 2 , then proj e F is the projection of F onto L e ; see figure 6.2. Clearly, 
|proj e x — proj fl y\ ^ \x — y\ if x, y e IR 2 , i.e. proj e is a Lipschitz mapping. Thus 

dimu(pTo] g F) ^ mmfdimn/ 7 , 1} (6.1) 

for any F and 9, by Corollary 2.4(a). (As proj e F is a subset of the line Lg, 
its dimension cannot be more than 1.) The interesting question is whether the 
opposite inequality is valid. The projection theorems tell us that this is so for 
almost all 9 e [0, tt); that is, the exceptional values of 9 for which inequality 
(6.1) is strict form a set of zero length (1 -dimensional Lebesgue measure). 

Projection theorem 6.1 

Let F C IR 2 be a Borel set. 

(a) If dimuF ^ 1 then dimH(proj e F) = dim^F for almost all 9 e [0, tt). 

(b) If dimuF > 1 then pro] g F has positive length (as a subset of Lg) and so 
has dimension 1 for almost all 9 e [0, n). 
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Figure 6.1 Top: projections of classical sets onto a plane — a curve 'typically' has pro- 
jection of dimension 1, but the surface and cube have projections of dimension 2 and 
of positive area. Bottom: projections of fractal sets onto a plane. If dimnFi < 1 and 
dim H F 2 > 1 then 'typically' the projection of F\ has dimension equal to &\m H Fi (and 
zero area) and the projection of Fi has dimension 2 and positive area 




Figure 6.2 Projection of a set F onto a line L 
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Proof. We give a proof that uses the potential theoretic characterization of Haus- 
dorff dimension in a very effective way. If s < during ^ 1 then by Theo- 
rem 4.13(Z?) there exists a mass distribution fi on (a compact subset of) F with 

< fi(F) < oo and 

dfi(x)dfi(y) 



J F J F 



\x-yY 



< 00. 



(6.2) 



For each 9 we 'project' the mass distribution fi onto the line Lg to get a mass 
distribution fig on proj g F. Thus fig is defined by the requirement that 

H$([a, b\) — l^{x : a < x ■ 6 ^ b) 

for each interval [a, b], or equivalently, 



/" 

•J —(. 



f(t)dn e (t) 



I. 



fix ■ $)&n(x) 



for each non-negative function /. (Here 6 is the unit vector in the direction 6, 
with x identified with its position vector and x ■ the usual scalar product.) Then 



JO U-OO J-l 



dfi e (u)dfX0(v) 



dO 



= f[// 

JO \_J F J F 

Jo Uf Jf 



dfi(x)dfi(y) 

\x-e -y-e\ s 

d/x(x)d/x(y) 



de 



de 



f \(x-y)-6\ 

diJ J (x)d/ J i(y) 



Jo \r-ev JfJf \x-y\> 



for any fixed unit vector t. (Note that the integral of \(x — y) ■ 0\~ s with respect 
to 8 depends only on \x — y\.) If s < 1 then (6.3) is finite by virtue of (6.2) 
and that 

< oo. 



r de r 

J |t- 0|* = i 



Hence 



Jf Jf 



cos(t -6»)| s 
d/x e (M)d/x fl (t;) 



< oo 



for almost all 6* e [0, n). By Theorem 4.13(a) the existence of such a mass 
distribution fig on proj g F implies that dimH^rojgf) ^ s. This is true for all 
s < dim H F, so part (a) of the result follows. 

The proof of (b) follows similar lines, though Fourier transforms need to be 
introduced to show that the projections have positive length. □ 

These projection theorems generalize to higher dimensions in the natural way. 
Let G, h k be the set of fc-dimensional subspaces or 'fc-planes through the origin' in 
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IR". These subspaces are naturally parametrized by kin — k) coordinates ('gen- 
eralized direction cosines') so that we may refer to 'almost all' subspaces in a 
consistent way in terms of k(n — k) -dimensional Lebesgue measure. We write 
proj n for orthogonal projection onto the fc -plane IL 



Theorem 6.2. Higher-dimensional projection theorem 

Let F C W be a Borel set. 

(a) If dime/ 7 $5 k then diiriH(proj n F) = diva^F for almost all Tl e G„,* 

(b) If divan F > k then proj n F has positive k- dimensional measure and so has 
dimension k for almost all W e G n ^. 

Proof. The proof of Theorem 6.1 extends to higher dimensions without diffi- 
culty. □ 

Thus if F is a subset of IR 3 , the plane projections of F are, in general, of 
dimension min{2, dim H F}. This result has important practical implications. We 
can estimate the dimension of an object in space by estimating the dimension 
of a photograph taken from a random direction. Provided this is less than 2, 
it may be assumed to equal the dimension of the object. Such a reduction can 
make dimension estimates of spatial objects tractable — box-counting methods 
are difficult to apply in 3 dimensions but can be applied with reasonable success 
in the plane. 



6.2 Projections of s-sets of integral dimension 

If a subset F of IR 2 has Hausdorff dimension exactly 1, then Theorem 6.1 tells us 
that the projections of F onto almost every L g have dimension 1. However, in this 
critical case, no information is given as to whether these projections have zero or 
positive length. In the special case where F is a 1-set, i.e. with < TC S (F) < oo, 
an analysis is possible. Recall from Theorem 5.3 that a 1-set may be decomposed 
into a regular curve-like part and an irregular dust-like part. The following two 
theorems provide another sharp contrast between these types of set. 

Theorem 6.3 

Let F be a regular 1-set in IR 2 . Then pro] g F has positive length except for at 
most one e [0, n). 

Sketch of proof. By Theorem 5.9(b) it is enough to prove the result if F is a subset 
of positive length of a rectifiable curve C. Using the Lebesgue density theorem 
to approximate to such an F by short continuous subcurves of C, essentially all 
we need to consider is the case when F is itself a rectifiable curve C\ joining 
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distinct points x and y. But clearly, the projection onto L e of such a curve is an 
interval of positive length, except possibly for the one value of 9 for which Lg 
is perpendicular to the straight line through x and y. □ 

(In general proj e ,F will have positive length for all 9; there is an exceptional 
value of 9 only if F is contained in a set of parallel line segments.) 

Theorem 6.4 

Let F be an irregular l-set in IR 2 . Then proj^T 7 has length zero for almost all 
0e[O,jr). 

Proof. The proof is complicated, depending on the intricate density and angular 
density structure of irregular sets. We omit it! □ 

These theorems may be combined in several ways. 

Corollary 6.5 

Let F be a l-set in K 2 . If the regular part of F has H 1 -measure zero, then proj e F 
has length zero for almost all 9; otherwise it has positive length for all but at 
most one value of 9. 

The following characterization of irregular sets is also useful. 

Corollary 6.6 

A l-set in R 2 is irregular if and only if it has projections of zero length in at least 
two directions. 



Example 6.7 

The Cantor dust F of figure 0.4 is an irregular l-set. 

Calculation. In Example 2.6 we showed that F is a l-set. It is easy to see that the 
projections of F onto lines Lg with tan 6* = \ and tan6> = —2 have zero length 
(look at the first few iterations), so F is irregular by Corollary 6.6. □ 

The results of this section have been stated for sets for which < H l (F) < oo, 
which is rather a strong property for 1 -dimensional sets to have, although one 
which occurs surprisingly often. However, the theorems can be applied rather 
more widely. If F is any set that intersects some rectifiable curve in a set of 
positive length, so that F contains a regular subset, then proj g F has positive 
length for almost all 9. Again, if F is a a -finite irregular set, i.e. one which may 
be expressed as a countable union of irregular 1-sets each of finite measure, then 
projg/ 7 has zero length for almost all 9; this follows by taking countable unions 
of the projections of these component 1-sets. 
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For the record, we state the higher-dimensional analogue of Theorems 6.3 and 
6.4, though the proofs are even more complicated than in the plane case. 

Theorem 6.8 

Let F be a k-set in W , where k is an integer. 

(a) If F is regular then proj n F has positive k-dimensional measure for almost 
all n e G, a . 

(b) If F is irregular then proj n F has zero k-dimensional measure for almost 
all n e G n *. 



6.3 Projections of arbitrary sets of integral dimension 

The theorems of the last section, although mathematically elegant and sophisti- 
cated, do not provide a complete answer to the question of whether projections 
of plane sets onto lines have zero or positive length. A subset F of R 2 of Haus- 
dorff dimension 1 need not be a 1-set or even be of a -finite V} -measure (i.e. a 
countable union of sets of finite H 1 -measure). Moreover there need not be any 
dimension function h (see Section 2.5) for which < TL h (F) < oo, in which 
case mathematical analysis is extremely difficult. What can be said about the 
projections of such sets? The surprising answer is that, by working in this rather 
delicate zone of sets of Hausdorff dimension 1 but of non-er -finite H 1 -measure, 
we can construct sets with projections more or less what we please. For example, 
there is a set F in U. 2 such that proj e F contains an interval of length 1 for almost 
all 9 with 0^9 < jic but with pro) e F of length zero for jjt ^ 9 < re. More 
generally, we have the following result which says that there exist sets for which 
the projections in almost all directions are, to within length zero, anything that 
we care to prescribe. The measurability condition in square brackets is included 
for completeness, but is best ignored by non-specialists! 

Theorem 6.9 

Let Gg be a subset of Lg for each 9 e [0, n) {such that the set Uo<e<jr ^e is 
plane Lebesgue measurable]. Then there exists a Borel set F C IR 2 such that 

(a) projg/ 7 d Gg for all 8, and 

(b) length (proj g F\Gg) — for almost all 9. 

In particular, for almost all 9, the set of points of Lg belonging to either Gg or 
proj e F, but not both, has zero length. 

Idea of proof. Without going into much detail, we indicate the basic building block 
for such sets, which has been termed the 'iterated Venetian blind' construction. 
This is shown in figure 6.3. Let £ be a line segment of length k. Let e be a 
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Figure 6.3 (a) The 'iterated Venetian blind' construction, (b) Projections in certain bands 
of directions have large lengths, whilst projections in other bands of directions have very 
small lengths 

small angle and k a large number. We replace E by k line segments of lengths 
roughly X/k, each at an angle e to E and with endpoints equally spaced along E 
to form a new set, E\. We repeat this process with each segment of E\ to form 
a set Ei consisting of k 2 line segments all of lengths about X/k 2 and at angle 
2e to E. We continue in this way, to get E r , a set of k r segments all of lengths 
about X/k 1 " and at angle re to E. We stop when r is such that re is, say, about 
jTC . Comparing the projections of E r with that of the original line segment E, 
we see that if ^ < jjr then pm) g E and proj e £V are nearly the same (since 
lines perpendicular to Lg that cut E also cut E r ). However, if— ^jt < < 
then proj e £ r will have very small length, since most lines perpendicular to Eg 
will pass straight between appropriately angled 'slats' of the construction. Thus 
the projections of E r are very similar to those of E in certain directions, but are 
almost negligible in other directions. This idea may be adapted to obtain sets 
with projections very close to Gg in a narrow band of directions but with almost 
null projections in other directions. Taking unions of such sets for various small 
bands of directions gives a set with approximately the required property. Taking 
a limit of a sequence of sets which give increasingly accurate approximations 
leads to a set with the properties stated. □ 



This construction may be extended to higher dimensions: there exists a set F 
in R" such that almost all projections of F onto fe-dimensional subspaces differ 
from prescribed sets by zero ^-dimensional measure. In particular there exists a 
set in 3-dimensional space with almost all of its plane shadows anything we care 
to prescribe to within zero area. By specifying the shadows to be the thickened 



Exercises 97 




Set of dimension 2 




Shadow that 
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projections 



Figure 6.4 A digital sundial 

digits of the time when the sun is shining from a perpendicular direction, we 
obtain, at least in theory, a digital sundial; see figure 6.4. As the sun moves 
across the sky we get different projections of the set. The notion of a digital 
sundial was introduced to provide an intuitive view of this result, rather than as a 
feasible method of chronography, but, amazingly, working digital sundials have 
recently been manufactured, albeit depending on a different principle! 

6.4 Notes and references 

A geometric proof of the projection theorems for arbitrary subsets of the plane 
was given by Marstrand (1954a); the potential theoretic proof was due to Kauf- 
man (1968). Mattila (1975) obtained various generalizations including extensions 
to higher dimensions. The projection results for regular and irregular 1-sets in 
the plane are, surprisingly, rather older, dating back to Besicovitch (1939), with 
the analogous results for s-sets in R" in the mammoth paper of Federer (1947). 
More detailed accounts of the projection theorems are in Falconer (1985a) and 
Mattila (1995). Projection results for box and packing dimensions are more sub- 
tle, see Falconer and Howroyd (1997) and Howroyd (2001). A dual version of 
Theorem 6.9 was given by Davies (1952) and a direct proof, with the higher- 
dimensional generalizations, by Falconer (1986a). 

Exercises 

6.1 Let E = F x F C R 2 where F is the middle k Cantor set. (E has Hausdorff dimen- 
sion 21og2/log(2/(l — k)), see Examples 4.5 and 7.6.) What is diniHprojg.E (a) for 
a typical 9 and (b) for 69 = and 9 = n/21 
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6.2 Let E be the 'circular Cantor set' given in complex number form by E = [e 2 "" p : 
<p e F] where F is the middle third Cantor set. What is dim H proj e F for each 91 

6.3 For < S < 1, give an example of an .s-set F in R 2 such that proj fl F is an s-set for 
all 9. 

6.4 Let E and F be subsets of R with Hausdorff dimension strictly between and 
1. You are given that the subset E x F of R 2 has Hausdorff dimension at least 
dim H F + dim H F (see Chapter 7). Show that the projections of E x F onto the 
coordinate axes are always 'exceptional' as far Projection theorem 6. 1 is concerned. 

6.5 Show that dim H proj e F > dim H F — 1 for all F C R 2 and all 9. 

6.6 Let F be an irregular 1-set in the plane. Deduce from Theorem 6.4 that F is totally 
disconnected. 

6.7 Let F be a connected subset of R 2 containing more than one point. Show that proj fl F 
has positive length for all except possibly one value of 9. (Thus the projection 
theorems in the plane are only really of interest for sets that are not connected.) 

6.8 Let E and F be subsets of R. Show that, for almost all real numbers 
X, dim H (F + XF) = min{l, dim H (F x F)}, where E + XF denotes the set of real 
numbers [x + Xy : x e E, y e F}. 

6.9 Show that the conclusion of Theorem 6.4 remains true if F is a countable union of 
irregular 1-sets. 

6.10 Let E and F be any subsets of R of length (1 -dimensional Lebesgue measure) 
0. Show that any rectifiable curve in R 2 intersects the product E x F in a set of 
length 0. 

6.11 If F is a set and x is a point in R 2 , the projection of F at x, denoted by proj x F, is 
defined as the set of 9 in [0, 2n) such that the half-line emanating from x in direction 
9 intersects F. Let L be a line. Show that if dim H F ^ 1 then dim H proj l F = dim H -F 
for almost all x on L (in the sense of Lebesgue measure) and if dimni 7 > 1 then 
proj x F has positive length for almost all x on L. (Hint: consider a sphere tangential 
to the plane and a transformation that maps a point x on the plane to the point on 
the sphere intersected by the line joining x to the centre of the sphere.) 

6.12 Let F C R 2 . Show that for all 9, dim B proj e F < dim B F, dims proj e F ^ dimeF 
and dim P proj e F ^ dim P F. Show that dim P proj e F = dim P F for almost all 9 if 
dim P F = dim H F. 



Chapter 7 Products of fractals 



One way of constructing 'new fractals from old' is by forming Cartesian products. 
Indeed, many fractals that occur in practice, for example as attractors of certain 
dynamical systems, are products or, at least, are locally product-like. In this 
chapter we develop dimension formulae for products. 



7.1 Product formulae 

Recall that if £ is a subset of W and F is a subset of E. m , the Cartesian product, 
or just product, E x F is defined as the set of points with first coordinate in E 
and second coordinate in F, i.e. 

E x F = {(x, y) e K n+m : x e E,y e F}. (7.1) 

Thus if £ is a unit interval in IR, and F is a unit interval in U. 2 , then E x F is 
a unit square in IR 3 (figure 7.1). Again, if F is the middle third Cantor set, then 
F x F is the 'Cantor product' (figure 7.2) consisting of those points in the plane 
with both coordinates in F . 

In the first example above it is obvious that 

dim(£ x F) — dimE + dimF 

using the classical definition of dimension. This holds more generally, in the 
'smooth' situation, where E and F are smooth curves, surfaces or higher-dimen- 
sional manifolds. Unfortunately, this equation is not always valid for 'fractal' 
dimensions. For Hausdorff dimensions the best general result possible is an 
inequality dimn(£ x F) ^ dimei? + dime/ 7 . Nevertheless, as we shall see, in 
many situations equality does hold. 

The proof of the product rule uses the Hausdorff measures on E and F to 
define a mass distribution /x on E x F . Density bounds on E and F lead to 
estimates for /x suitable for a mass distribution method. 
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Products of fractals 





Figure 7.1 The Cartesian product of a unit interval in R. and a unit interval in R 2 

Proposition 7.1 

//£CP,FC R m are Borel sets with H S (E), H'(F) < oo, then 

H S+ '(E x F) ^ cH s (E)H'(F) (7.2) 

where c > depends only on s and t. 

Proof. For simplicity we assume that E, F C R, so that E x F C IR 2 ; the general 
proof is almost identical. If either H S (E) or 7i'(F) is zero, then (7.2) is trivial, 



Figure 7.2 The product F x F, where F is the middle third Cantor set. In this case, 
dim H F x F = 2 dim H F = 2 log 2/ log 3 
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so suppose that F is an s-set and F is a f-set, i.e. < W(E), H'(F) < oo. We 
may define a mass distribution /x on E x F by utilizing the 'product measure' 
of TL S and H' . Thus if /, 7 C R, we define /x on the 'rectangle' / x J by 

/x<7 x J) = n s (Enl)H t (FnJ). (7.3) 

It may be shown that this defines a mass distribution /x on F x F with /x([R 2 ) = 
H S (E)H'(F). 

By the density estimate Proposition 5.1(&) we have that 

]m\H s (E n B(x, r)){2rT s sC 1 (7.4) 

r->0 

for H s -almost all x e E and 

lim W'(F n B(y, r))(2r)~' ^ 1 (7.5) 

r->0 

for W-almost all y e F. (Of course, since we are concerned with subsets of R, 
the 'ball' B(x, r) is just the interval of length 2r with midpoint x.) From the 
definition of /x, both (7.4) and (7.5) hold for /x-almost all (x, y) in E x F. Since 
the disc B((.x, j), r) is contained in the square B(x, r) x B(y, r) we have that 

fi(B((x, y), r)) sC /x(B(x, r) x B(y, r)) = ft s (F n B(x, r))H f (F n B(y, r)) 

so 

/x(B((x, y), r)) ?f (F n B(x, r)) 7i'(F n S(y, r)) 

(2r)- 5 +' ^ (2r)* (2ry ' 

It follows, using (7.4) and (7.5), that iim r ^ /x(B((x, y), r))(2r)- (s+t) sC 1 for 
/x-almost all (x, y) e F x F. By Proposition 4.9(a) 

H S (E x F) ^ 2 _(s+0 /x(£ x F) = 2- (s+ ' > n s (E)H'(F). D 



Product formula 7.2 

//£cR",fC R'" are Bore/ sett f/ze« 

dime(F x F) ^ dimnF + dimnF. (7.6) 

Proof. If s, / are any numbers with s < dimnF and t < dimnF, then TL S (E) — 
H'(F) — oo. Theorem 4.10 implies that there are Borel sets Eq C E and F) C F 
with < n s (E ),n t (F ) < oo. By Proposition 7.1 H s+ '(Ex F)^H S+ '(E x 
Fq)^cH s (E q)TL* (Fq) > 0. Hence dim H (F x F) ^ s + 1 . By choosing s and t 
arbitrarily close to dimnF and dimeF, (7.6) follows. D 

Proposition 7.1 and Formula 7.2 are in fact valid for arbitrary (non-Borel) sets. 
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It follows immediately from (7.6) that the 'Cantor product' F x F, where F is 
the middle third Cantor set, has Hausdorff dimension at least 2 log 2/ log 3 (see 
figure 7.2). 

In general, inequality (7.6) cannot be reversed; see Example 7.8. However, if, 
as often happens, either E or F is 'reasonably regular' in the sense of having 
equal Hausdorff and upper box dimensions, then we do get equality. 

Product formula 7.3 

For any sets E C R" and F C R'" 

dim H (F x F) < dim H F + dims/ 7 . (7.7) 

Proof. For simplicity take E C R and Fcl. Choose numbers s > dim H F and 
t > dim B F. Then there is a number <5 > such that F may be covered by 
N&(F) ^ &~' intervals of length S for all S SC Sq. Let {£/,} be any <5-cover of E 
by intervals with ^ l^'l' < !• For each j, let £/,j be a cover of F by -/V|j/ ; |(F) 
intervals of length |£/,-|. Then (/,- x F is covered by N\u t \(F) squares {£/,- x E/,j} 
of side \Ui\. Thus £ x F c U; U/W' x U u ), so that' 

i 3 i 

^ 2 (,+0/2^| (/ . r ,| (/ . r +r <2 ( S +0/2_ 

Letting <5 ^ gives 7i s+t {E x F) < oo whenever s > dimeF and t > dim B F, 
so dim H (F x F) ^ s + t. D 

Corollary 7.4 

7f dim H F = dim B F then 

dim H (F x F) — dim H F + dim H F. 

Proof. Note that combining Product formulae 7.2 and 7.3 gives 

dinted + din^F SC diiriH(F x F) SC dimeF + dim B F. D (7.8) 

It is worth noting that the basic product inequality for upper box dimensions 
is opposite to that for Hausdorff dimensions. 

Product formula 7.5 

For any sets E C R" and F C R m 

dim B (F x F) $C dim B F + dim B F. (7.9) 



Product formulae 103 

Proof. This is left as Exercise 7.5. The idea is just as in Formula 7.3 — note that if 
E and F can be covered by N$(E) and N S (F) intervals of side 8, then E x F is 
covered by the Ng (E)N$ (F) squares formed by products of these intervals. □ 



Example 7.6. Product with uniform Cantor sets 

Let E, F be subsets of & with F a uniform Cantor set (see Example 4.5). Then 
dimnCii x F) — dim H £ + dim H F. 

Calculation. Example 4.5 shows that uniform Cantor sets have equal Hausdorff 
and upper box dimensions, so the result follows from Corollary 7.4. □ 

Thus the 'Cantor product' of the middle third Cantor set with itself has Haus- 
dorff and box dimensions exactly 2 log 2/ log 3. Similarly, if £ is a subset of R. 
and F is a straight line segment, then dimu(E x F ) = dimuE + 1. 

Many fractals encountered in practice are not actually products, but are locally 
product-like. For example, the Henon attractor (see (13.5)) looks locally like a 
product of a line segment and a Cantor-like set F. More precisely, there are 
smooth bijections from [0, 1] x F to small neighbourhoods of the attractor. Such 
sets may be analysed as the image of a product under a suitable Lipschitz trans- 
formation. 



Example 7.7 

The 'Cantor target' is the plane set given in polar coordinates by F' — {(r, 9) : 
r e F, ^ 8 SC 2tc} where F is the middle third Cantor set; see figure 7.3. Then 
dim H ,F' = 1 + log 2/ log 3. 

Calculation. Let / : R 2 — > U. 2 be given by fix, v) = (x cos y, x sin y). It is easy 
to see that / is a Lipschitz mapping and F' — f(F x [0, 2tt]). Thus 

dim H F' = dim H /(/ 7 x [0, 2tt]) ^ dim H (^ x [0, In]) 
= dim H F + dim H [0, 2tt] = (log 2/ log 3) + 1 



by Corollary 2.4(a) and Example 7.6. On the other hand, if we restrict / to 
[1,1] x [0, Jt] then / is a bi-Lipschitz function on this domain. Since F' D 



f((F n [|, 1]) x [0, 77"]) we have 



dim H F' ^ dim n f((F n [f , 1]) x [0, n]) 
= dim H ((Fn[f,l])x[0,jr]) 
= dim H (Fn[f,l]) + dim H [0,7r] 
= (log 2/ log 3) + 1 
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Figure 7.3 The 'Cantor target' — the set swept out by rotating the middle third Cantor 
set about an endpoint 

by Corollary 2.4(b) and Example 7.6. This argument requires only minor modi- 
fication to show that F' is an 5 -set for 5=1 + log 2/ log 3. □ 

The following example demonstrates that we do not in general get equality in 
the product formula (7.6) for Hausdorff measures. 



Example 7.8 

There exist sets E, F C K with dim H i? = dim H F = and dim H (£ x F) ^ 1. 

Calculation. Let = mo < m\ < ■ ■ ■ be a rapidly increasing sequence of integers 
satisfying a condition to be specified below. Let E consist of those numbers 
in [0, 1] with a zero in the rth decimal place whenever m^ + 1 SC r SC ntk+i 
and k is even, and let F consist of those numbers with zero in the rth deci- 
mal place if nik + 1 ^ r ^ m^+i and k is odd. Looking at the first m^+x deci- 
mal places for even k, there is an obvious cover of E by 10 ;t intervals of 
length 10 _mt+1 , where jk — (m^ — ni\) + (ni\ — mj,) + ■ ■ ■ + (m^ — itik-i). Then 
log 10-"7 — log 10~'" t+1 = jk/m k+ i which tends to as k — > oo provided that 
the ffi( are chosen to increase sufficiently rapidly. By Proposition 4.1 dim H E ^ 
dim B E — 0. Similarly, dim^F = 0. 

If < w < 1 then we can write w — x + y where x e E and y e F; just take 
the rth decimal digit of w from E if nij + 1 ^ r ^ nik+\ and k is odd and from 
F if k is even. The mapping / : R 2 — >■ E. given by f(x,y) — x + y is easily 
seen to be Lipschitz, so 

dim H (£ x F) ^ dim H /(£ x F) ^ dim H (0, 1) = 1 

by Corollary 2.4(a). □ 
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A useful generalization of the product formula relates the dimension of a set 
to the dimensions of parallel sections. We work in the (x, j)-plane and let L x 
denote the line parallel to the y-axis through the point (x, 0). 



Proposition 7.9 

Let F be a Borel subset of R 2 . If 1 < s ^ 2 then 



H s ~ l (F n L x )dx ^ H S (F). (7.10) 



/_ 

Proof. Given e > 0, let {£/,} be a 8 -cover of F such that 

Each [/, is contained in a square 5, of side | £/,■ | with sides parallel to the coor- 
dinate axes. 

Let Xi be the indicator function of 5, (i.e. Xi(x, y) — 1 if (x, y) e St and 
Xi(x, y) — if (x, y) <£ Si). For each x, the sets {Si D L x \ form a <5-cover of 

F C\ L x , so 

= ^|f/ ! -r 2 |5 ! -ni ;c | 

= Y,\Ui\'- 2 f Xi(x,y)dy. 

i 

Hence 

j h s s -\f n L x )dx ^Y.\ u ^ 2 jjxi {x >y^ xA y 

i 

Since e > is arbitrary, fH s s ~ l (F l~l LJdx < H|(F). Letting 5^0 gives 
(7.10). □ 



106 Products of fractals 

Corollary 7.10 

Let F be a Borel subset of R 2 . Then, for almost all x (in the sense of 1 -dimensional 
Lebesgue measure), dim^F D L x ) $C max{0, dim H F — 1}. 

Proof. Take s > dim H F, so that W(F) = 0. If s > 1, formula (7.10) gives 
H s ~ l (F n L x ) = and so dim H (F n L x ) ^ s - 1, for almost all jc. D 

We state, without proof, a further useful generalization. 

Proposition 7.11 

Let F be any subset of R. , anof Ze/ £ foe any subset of the x-axis. Suppose that 
there is a constant c such that TC' (F D L x ) ^ c for all x e E. Then 

H s+ '(F)^bcH s (E) (7.11) 

where b > depends only on s and t. 

This result may be phrased in terms of dimensions. 

Corollary 7.12 

Let F be any subset of IR 2 , and let Ebe a subset of the x-axis. If dime (F D L x ) ^ t 
for all x e E, then dim H F ^ t + dim H Z? . 

The obvious higher-dimensional analogues of these results are all valid. 
The following illustration of Proposition 7.9 is an example of a self-affine set, 
a class of sets which will be discussed in more detail in Section 9.4. 

Example 7.13. A self-affine set 

Let F be the set with iterated construction indicated in figure 7.4. (At the kth stage 
each rectangle of E^ is replaced with an affine copy of the rectangles in E\. Thus 
the contraction is greater in the 'y' direction than in the 'x' direction, with the 
width to height ratio of the rectangles in Ek tending to infinity as k — >■ oo.) Then 
dim H £ = dim B F = l|. 

Calculation. E^ consists of 6 k rectangles of size lr k x 4~ k . Each of these rect- 
angles may be covered by at most (4/3)* + 1 squares of side 4~ k , by dividing 
the rectangles using a series of vertical cuts. Hence Et may be covered by 
6 k x 2 x 4 k x 3~ k = 2x8* squares of diameter 4 _ * x /2. In the usual way this 
gives dimnF ^ dimeF ^ lj- 

On the other hand, except for x of the form j3~ k where j and k are integers, we 
have that Et D L x consists of 2 k intervals of length 4~ k . A standard application 
of the mass distribution method shows that H l ^ 2 (Ek D L x ) ^ i for each such x. 
By Proposition 7.9, H 3/2 (F) ^ \. Hence 6im n F = dim B F = \\. O 



Exercises 107 




F 
Figure 7.4 Construction of a self-affine set, dim H F = 1 1 

7.2 Notes and references 

Versions of the basic product formula date from Besicovitch and Moran (1945). A 
very general result, proved using net measures, was given by Marstrand (1954b). 
For packing dimensions of products, see Tricot (1982) and Howroyd (1996). 



Exercises 

7.1 Show that for every set F c [0, 1], we have dim H (-F x [0, 1]) = 1 + dim H -F. 

7.2 Let F k denote the middle X Cantor set (see Example 4.5). What are the box and 
Hausdorff dimensions of Fx x F^ for < X , [i < 1 ? 

7.3 Let F be any subset of [0, oo) and let F' be the 'target' in K 2 given in polar 
coordinates by {(r, 6) : r e F, ^ 9 < 2n}. Show that dim H F' = 1 + dim H F. 

7.4 Show that there is a subset F of R 2 of Hausdorff dimension 2 with projections onto 
both coordinate axes of length 0. (Hint: see Exercise 4.9.) Deduce that any 1-set 
contained in F is irregular, and that any rectifiable curve intersects F in a set of 
length 0. 

7.5 Derive Product formula 7.5. 

7.6 What are the Hausdorff and box dimensions of the plane set {(x, y) € K 2 : x + y e 
F and x — y e F}, where F is the middle third Cantor set? 

7.7 Let Fcl have equal Hausdorff and upper box dimensions. Let D be the 
set {x — y : x, y e F), known as the difference set of F . Show that dim H £> ^ 
min{l, 2dim H F). (Hint: consider the set F x F.) 

7.8 Sketch the plane set {(x, y) : y — x 2 e F] where F is the middle third Cantor set, 
and find its Hausdorff and box dimensions. 

7.9 Let L x be as in Proposition 7.9. Let F be a subset of K 2 and let E s = {x e K : 
dim H (-F fl L x ) > s] for ^ s ^ 1. Show that dim H F > sup 0<s<1 {s + dim H £".,}. 
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7.10 Divide the unit square Eq into a three column, five row array of rectangles of sides i 
and j, and let E\ be a set obtained by choosing some four of the five rectangles from 
each column. Let F be the self-affine set formed by repeatedly replacing rectangles 
by affine copies of E\ (compare Example 7.13). Adapt the method of Example 7.13 
to show that dimH-F = 1 + log 4/ log 5. 

7.11 Modify the construction of the previous exercise so that E\ contains four rectangles 
from each of the first and third columns but none from the middle column. Show 
that dim H F = log 2/ log 3 + log 4/ log 5. 



Chapter 8 Intersections of fractals 



The intersection of two fractals is often a fractal; it is natural to try to relate the 
dimension of this intersection to that of the original sets. It is immediately appar- 
ent that we can say almost nothing in the general case. For if F is bounded, there 
is a congruent copy F\ of F such that dimnC-F fl F\) — dime/ 7 (take 
F\ — F) and another congruent copy with dimi^F fl F\) = (take F and F\ 
disjoint). However, we can say rather more provided we consider the intersection 
of F and a congruent copy in a 'typical' relative position. 

To illustrate this, let F and F\ be unit line segments in the plane. Then F H Fy 
can be a line segment, but only in the exceptional situation when F and F\ are 
collinear. If F and F\ cross at an angle, then F D F\ is a single point, but now 
F fl F\ remains a single point if F\ is replaced by a nearby congruent copy. 
Thus, whilst 'in general' F C\ F\ contains at most one point, this situation occurs 
'frequently'. 

We can make this rather more precise. Recall that a rigid motion or direct 
congruence transformation a of the plane transforms any set £ to a congruent 
copy o(E) without reflection. The rigid motions may be parametrized by three 
coordinates (x, y, 9) where the origin is transformed to (x, y) and 9 is the angle 
of rotation. Such a parametrization provides a natural measure on the space of 
rigid motions, with the measure of a set A of rigid motions given by the 3- 
dimensional Lebesgue measure of the (x,y,9) parametrizing the motions in A. 
For example, the set of all rigid motions which map the origin to a point of the 
rectangle [1,2] x [0, 3] has measure 1 x 3 x 2tt. 

In the example with F a unit line segment, the set of transformations a for 
which F fl a(F) is a line segment has measure 0. However, F H o(F) is a single 
point for a set of transformations of positive measure, in fact a set of measure 4. 

Similar ideas hold in higher dimensions. In U 3 , 'typically', two surfaces inter- 
sect in a curve, a surface and a curve intersect in a point and two curves are 
disjoint. In R", if smooth manifolds E and F intersect at all, then 'in general' 
they intersect in a submanifold of dimension max{0, dim E + dim F — n}. More 
precisely, if dim E + dim F — n > then dim (E Ha (F)) — dim E + dim F — n 
for a set of rigid motions a of positive measure, and is for almost all other 
cr. (Of course, a is now measured using the i«(n + 1) parameters required to 
specify a rigid transformation of E. n .) 
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8.1 Intersection formulae for fractals 



Are there analogues of these formulae if E and F are fractals and we use Haus- 
dorff dimension? In particular, is it true that 'in general' 



dimu(E f~l <r(F)) ^ max{0, dimuE + dim^F — n] 



(8.1) 



and 'often' 



dim u (E D cr(F)) > dim H £ + dim H ^ — n 



(8.2) 



as a ranges over a group G of transformations, such as the group of translations, 
congruences or similarities (see figure 8.1)? Of course 'in general' means 'for 
almost all a ' and 'often' means 'for a set of a of positive measure' with respect to 
a natural measure on the transformations in G. Generally, G can be parametrized 
by m coordinates in a straightforward way for some integer m and we can use 
Lebesgue measure on the parameter space IR m . 

We may obtain upper bounds for dimn(£ f~l o(F)) when G is the group of 
translations; these bounds hold automatically for the larger groups of congruences 
and similarities. We have already proved (8.1) in the special case in the plane 
where one of the sets is a straight line; this is essentially Corollary 7.10. The 
general result is easily deduced from this special case. Recall that F + x = 
[x + y : y e F] denotes the translation of F by the vector x. 






££^ 







=$#T£ 



Figure 8.1 The intersection of a 'dust-like' set E with various congruent copies o(F) of 
a curve F. We are interested in the dimension of E n a(F) for 'typical' a 
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Theorem 8.1 

IfE,F are Borel subsets of W then 

dim H (£ n (F + x)) sC max{0, dim H (£ x F) - n\ (8.3) 

for almost all x e R" . 

Proof. We prove this when n — 1 ; the proof for n > 1 is similar, using a higher- 
dimensional analogue of Corollary 7.10. Let L c be the line in the (x, v)-plane with 
equation x — y + c. Assuming that dim H (£ x F) > 1, it follows from Corollary 
7.10 (rotating the lines through 45° and changing notation slightly) that 

dim H ((£ x F)H L c ) ^ dim H (£ x F) - 1 (8.4) 

for almost all eel. But a point (x, x — c) e (E x F) D L c if and only if 
x G E n (F + c). Thus, for each c, the projection onto the x-axis of (E x F) n L c 
is the set E n (F + c). In particular, dim H (£ n (F + c)) = dim H ((£ x F) D L c ), 
so the result follows from (8.4). □ 

Theorem 8.1 is some way from (8.1), but examples show that it is the best 
that we can hope to achieve, even if the group of translations is replaced by the 
group of all rigid motions. Unfortunately, inequality (7.6) is the opposite to what 
would be needed to deduce (8.1) from (8.3). Nevertheless, in many instances, 
we do have dim^F x F) = dimnii + dimnF; for example, if dime/ 7 = dime/ 7 ; 
see Corollary 7.4. Under such circumstances we recover (8.1), with a(F) as the 
translate F + x . 

Lower bounds for dimn(E (~) cr (F)) of the form (8.2) are rather harder to 
obtain. The main known results are contained in the following theorem. 

Theorem 8.2 

Let E, F C U." be Borel sets, and let G be a group of transformations on W 1 . 
Then 

dim H (E na(F)) ^ dim H E + dim n F -n (8.5) 

for a set of motions a e G of positive measure in the following cases: 

(a) G is the group of similarities and E and F are arbitrary sets 

(b) G is the group of rigid motions, E is arbitrary and F is a rectifiable curve, 
surface, or manifold. 

(c) G is the group of rigid motions and E and F are arbitrary, with either 
dimei? > i(n + 1) or dim^F > ^(n + 1). 

*0utline of proof. The proof uses the potential theoretic methods of Section 4.3. In 
many ways, the argument resembles that of Projection theorem 6.1, but various 
technical difficulties make it much more complicated. 
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Briefly, if s < dim H £ and t < dim H F, there are mass distributions jjl on E and 
v on F with the energies I s (/x) and /, (v) both finite. If v happened to be absolutely 
continuous with respect to n-dimensional Lebesgue measure, i.e. if there were a 
function / such that v(A) — f A f(x)dx for each set A, then it would be natural 
to define a mass distribution r\ a on E D a(F) by r) a (A) — f A f(cr(x))dix(x). 
If we could show that I s+t - n (j) a ) < oo for almost all a , Theorem 4.13(a) would 
imply that dime(£ f~l a(F)) ^ s + t — n if rj a (E. n ) > 0. Unfortunately, when F 
is a fractal, v is supported by a set of zero n -dimensional volume, so is anything 
but absolutely continuous. To get around this difficulty, we can approximate v by 
absolutely continuous mass distributions v s supported by the S -neighbourhood of 
F. Then, if v$(A) — f A fg(x)dx and r\ a j, — f A fs(<r~ l (x))dfi(x), we can estimate 
Is+t-n(y<r,s) and take the limit as S — > 0. Simplifying the integral f I s+t _ n (r) a $)do 
isolates a term 

<Ps(w)= / / v s (y)v s (y + (x(w))dyd(T 

J Co JW." 

where integration with respect to a is now over the subgroup Go of F which 
fixes the origin. Provided that 

<Ps(w) ^ constantly |'"" (8.6) 

for all w and 8, it may be shown that f I s+t ^ n (j] a ^)da < c < oo, where c is inde- 
pendent of S. Letting <5 ^ 0, the measures jj ff> $ 'converge' to measures r\ a on E D 
cr(F), where f I s+t - n (rj a ) da < c. Thus I s+t - n (r] a ) < oo for almost all a , so, by 
Theorem 4.13(a), dim H (£' D o(F)) ^ s + t — n whenever r\ a (E D (?(F)) > 0, 
which happens on a set of positive measure. 

It may be shown that (8.6) holds if I,{v) < oo in the cases (a), (b) and (c) 
listed. This is relatively easy to show for (a) and (b). Case (c) is more awkward, 
requiring Fourier transform theory. □ 

The condition that dim H Zi or dim H F > \(n + 1) in case (c) is a curious con- 
sequence of the use of Fourier transforms. It is not known whether the theorem 
remains valid for the group of congruences if n ^ 2 and ^n < dmiH-E, dimnF ^ 

kn+l). 



Example 8.3 

Let F C K be the middle third Cantor set. For X, x e E. write XF + x — [Xy + 
x : y e F}. Then dim H (F f~l (F + x)) $C 2(log2/log3) — 1 for almost all x e 
R, and dim H (^ D (XF + x)) = 2(log2/log3) - 1 for a set of (x, X) e IR 2 of 
positive plane Lebesgue measure. 

Calculation. We showed in Example 7.6 that dim H (F x F) — 2(log2/log3), so 
the stated dimensions follow from Theorems 8.1 and 8.2(a). □ 



Sets with large intersection 113 

The box and packing dimensions of intersections of sets can behave very 
irregularly, with the dimensions of intersections often being 'unexpectedly small', 
see Exercise 8.7. 



*8.2 Sets with large intersection 

We have seen that (8.1) need not always hold; in this section we examine a class 
of sets for which it fails dramatically. We construct a large class C of subsets 
of E. of Hausdorff dimension at least s with the property that the intersection of 
any countable collection of sets in C s still has dimension at least s. Sets of this 
type occur naturally in number theory; see Section 10.3. 

The class C s is defined in terms of the sums (2.1) used in the definition of 
Hausdorff measures. For any subset F of E. we define 

ICO CO j 

J2 |f/, I s : |J U t is any cover of F 1 . 
i=i i=i J 

Thus H^iF) is defined using covers of F without any diameter restriction. This 
ensures that 7-^(7) is finite if / is a bounded interval, which would not be the 
case if we used W '. It is easy to see that W^(Fj U F 2 ) ^ H^(Fi) + H S 00 (F 2 ) 
and that H^iF^ n s 00 (F 2 ) if F,cF 2 . 

Recall that lim^^^E^ — Hi=i UT=i ^k is the set of points that belong to 
infinitely many E k ; such sets are termed Urn sup sets. Let < s < 1 and let 
[a, b] C K be a proper closed interval. We say that a subset F of [a,b] is a 
member of the class C s [a, b] if 



F D lim E k (8.7) 

k— >oo 

where {E k } is a sequence of subsets of [a, b], such that 

(i) each E k is a finite union of disjoint closed intervals, and 

(ii) iim k ^ 00 n s 00 (inE k ) = \ir (8.8) 

for every bounded closed interval /. 

(Of course, we always have 'H s ryo {I fl E k ) ^ |/| s .) We define C s (— oo, oo) by say- 
ing that F is in C s (— oo, oo) if F C\ I e C ? [a, b] for every bounded interval [a, b]. 
The results below extend easily from C s [a, b] to C s (— oo, oo). 

As an example of the sets we have in mind, we might take E^ — [x : 
\x — p/k\ < k~ 3 for some integer p], so that F — Yim^^^E^ consists of the 
numbers which satisfy the inequality \x — p/k\ < k~ 3 for some p for infinitely 
many positive integers k. As we shall see, F e C'^ 3 (— oo, oo). 

Any set in C s [a, b] must be dense in [a, b]; for if F is in C[a, b] and / is a 
closed interval, then / D E kl contains a closed interval I\ if k\ is large enough, 
by (8.8). Similarly, l\ D E^ 2 contains a closed interval I 2 for some k 2 > k\. 
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Proceeding in this way, we get a sequence of closed intervals I D I\ D I2 D ■ ■ ■ 
with I r C E kr for each r. Thus the non-empty set Hr=i ^ r K con t ame d m infinitely 
many E k , so is contained in F C\ I . 

By Proposition 3.4 any set in C s [a,b] has box-counting dimension 1. We 
now show that these sets have Hausdorff dimension at least s. Moreover the 
intersection of any countable collection of sets in C s [a,b] is also in C[a,b] 
and so has dimension at least s. Furthermore f(F) is in C[f(a), f(b)] if F 
is in C s [a, b], for a large class of functions /. The proofs below are somewhat 
technical and might well be omitted on a first reading. We require the following 
lemma which extends (8.8) to unions of closed intervals and which is the key to 
these properties. 

Lemma 8.4 

Let {Ek} be a sequence of closed subsets of E. such that 

lim H S X (I n E k ) = \I\ S (8.9) 

for every bounded closed interval I. Then, if A is a bounded set made up of a finite 
union of closed intervals, 

HmH s 00 (AnE k ) = H s 00 (A). (8.10) 

k^oo 

Proof. Suppose that A consists of m disjoint intervals with minimum separation 
d > 0. Given s > we may, using (8.9), choose k E such that if k ^ k E 

H s 0o (inE k )>(l- E )\I\ s (8.11) 

whenever |/| ^ sd and I C A. (Since H^iE^ (~) I) varies continuously with / 
in the obvious sense, we may find a k s such that (8.11) holds simultaneously 
for all such /.) To estimate Ti^iA n E k ) let {(/,} be a cover of A n E k . We 
may assume that this cover is finite, since A H E k is compact (see Section 2.4) 
and also that the £/,■ are closed intervals with endpoints in A, which are disjoint 
except possibly at endpoints. We divide the sets U; into two batches according to 
whether |(/,| ^ d or |[/,-| < d. The set A \ Uiy i>j U> consists of disjoint intervals 
V\ , . . . , V r where r ^ m, and 

AC |J UiUlJVj. (8.12) 

\u,\>d j 

Observe that any [/, with |t/,| < d is contained in an interval of A, and so in 
one of the Vj . For each j the sets £/,■ contained in Vj cover Vj H E k , so 

J2 \u t \ s ^ H^iVj n E k ) > (i - E )\Vj\ s 

{i-.UtCV,} 
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if \Vj\^ sd, by (8.11). Hence 

£|tf,|'> J2 W'\ S+ E E l u 'l'> E |f/ ' |S+ E (l-e)Wj\'. (8.13) 
From (8.12) 

k,(a)^ J2 w + E w+ E i y /i s ^ E w + E i^i' + w™- 

|tfil><* |V)|>ed |V)|<e<* |t/,l>d VjISserf 

Combining with (8.13) we see that 

?o a ) < (i - £)"' E i^'i 1 + ( £d)Sm 

i 

for any cover {[/,■} of A H £#. Thus 

H S X (A) < (1 - £)"'^(A n E k ) + (erf/m 

if /t ^ Jt e , which implies (8.10). □ 

Repeated application of this Lemma now gives the dimension estimate we 
require. 

Proposition 8.5 

If F e C s [a, b] then 7i s (F) > 0, and in particular dime/ 7 ^ s. 

Proof. For simplicity of notation assume that [a, b] — [0, 1]. Suppose Mm^ooEk 
C F C U; Ui where the U, are open sets. Taking / = [0, 1] in (8.8) we may 
find a number k\ such that TC^iE^) > i. Since E^ is a finite union of closed 
intervals, Lemma 8.4 implies there is a number ki > k\ such that Hf^iE^ D E^) 
> j. Proceeding in this way, we get a sequence k\ < ki < ■ ■ ■ such that 

H'^iEk, n ■ ■ ■ n E kr ) > \ for all r. We have f|~, E ki C F C \Ji ^ since £ *i n 
• • • fl £^ is a decreasing sequence of compact (i.e. closed and bounded) sets and 
U- f/,' is open, there is an integer r such that E/ ll D ■ ■ ■ D £t r C U; ^'- K follows 
that ^. \Ui\ s ^ 7i'l (Ek 1 C\ ■ ■ ■ f~l £^ r ) > | for every cover of F by open sets, so 
W S CF) ^ i. D 

Proposition 8.6 

Let Fj e C s [a, b] for j = 1,2,.... Then f|°li ^j G C 5 [a, &]. 
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Proof. For each j there is a sequence of sets £/,£, each a finite union of closed 
intervals, such that Fj D lim^oois^, where lim^oo 7^,(7 fl Ej k ) — 7^,(7) 
for every interval /.By Lemma 8.4 

HmH s 00 (AnE jik ) = H s 00 (A) (8.14) 

for any finite union of closed intervals A. There are countably many intervals 
[c, d] C [a, b] with c and d both rational: let I\, h, ■ ■ ■ be an enumeration of all 
such intervals. 

For each r we define a set G r as follows. Using (8.14) we may choose k\ ^ r 
large enough to make 

Hl(I m n E Uki ) > W^(/ m ) - l/r 

simultaneously for m — 1, . . , , r. Using (8.14) again, taking A — I m D Ei kl , we 
may find ki ^ r such that 

W«(/m n E hkl n £ 2 ,, 2 ) > H^(/ m ) - l/r 

for m = 1, . . . , r. Continuing in this way, we get k\, . . . , k r ^ r such that 

Ko( 7 m n fl E i*> ) > W "oo( 7 m) - l/r (8-15) 

for all m = 1, . . . , r. For each r, let G r be the finite union of closed intervals 

r 

G r = (~)Ej, kj . (8.16) 

7=1 

Let 7 C [a,b] be any closed interval. Given e > 0, there is an interval / m C 
/ such that W^(/ m ) > W^(7) - e/2. If r ^ m and r > 2/e, (8.15) gives that 
7*^(7 n G r ) ^ W^(7 m n G r ) > Ti^iU - l/r > Hid) - s, so 

lim^(/nG r ) = H J co (/), 

Let y be any positive integer. If r > j and x e G r , then x e £/,*,, by (8.16). Thus 
if x e limj-^oo G r , then x e £/,*, for infinitely many kj, so x e linii_> 00 .E ; - >I - c F,-. 
Hence linv^ooG,. C 7} for each j, so Hi=i F € C' ? [a, ft]. D 

Corollary 8.7 

Let Fj e C s [a, £>] /or j = 1,2,.... 77zew dime n,=i Fj ^ *• 
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Proof. This is immediate from Propositions 8.5 and 8.6. □ 

Clearly, if F is in C s (— oo, oo) then so is the translate F + x. Hence, given a set 
F in C s {— oo, oo) and a sequence of numbers X\, %2, . . ., we have n^i(^ + x i) 
a member of C s {— oo, oo), so that this intersection has dimension at least s. The 
same idea may be applied using more general transformations of F . 



Proposition 8.8 

Let f : [a, b] — >■ E. be a mapping with a continuous derivative such that \f'(x)\ > c 
for some constant c > 0. If F e C s [a, b], then f(F) e C s [f{a), f(b)]. 

Proof. This may be proved in the same sort of way as Proposition 8.4. We omit 
the (rather tedious) details. □ 

In a typical C s set the Ek are made up of intervals which have lengths and 
spacings tending to as k — >■ oo. 



Example 8.9 

Fix a > 2. Let E^ — [x : \x — p/k\ ^ k~ a for some integer p], so that Et is 
a union of equally spaced intervals of length 2k~ a . Then F — liim.^ oo Ek is a 
member of C s (— oo, oo) for all s < \/a. 

Proof. Take < s < \/a and a bounded closed interval /. We must show that 

lim?0/n£,) = |/r. (8.17) 

The interval / contains m complete intervals of E^, each of length 2k~ a , where 
m ^ k\I\ — 2. Let /x be the mass distribution on / D Ek obtained by distributing 
a mass \lm uniformly across each complete interval of Ek contained in /. To 
estimate H S X (I fl Ek), let U be a set in a covering of / f~l Ek', we may assume 
that U is a closed interval and that the ends of U are points of / D Ek- Then U 
intersects at most k\U\+2 intervals of I D E k . If l/2k ^ \U\ ^ \I\ then 

fi(U) ^(k\U\+ 2)/m ^(k\U\+ 2)/(k\I\ -2) = (\U\ + 2^ 1 )/(|/| - 2AT 1 ) 
= \U\ s {\U\ l - s + 2k-'\U\- s )/{\I\-2k- 1 ) 
^ \U\ s (\U\ l - s + 2 s+l k s - x )/(\I\-2k- v ) 
_ \U\ S {\U\ l - s \l\ s - x +2 s+l k s ~ l \l\ s ~ l ) 

= W (1-2*- 1 !/!- 1 ) 

\_ L y ii ) 

\l\ s (1 -2k~ x \l\- v ) 
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On the other hand, if k is large enough and \U[ < I /2k, then U can intersect just 
one interval of E^ so \U\ ^ 2k~ a , since the endpoints of U are in E^. A mass 
of \lm is distributed evenly across this interval of length 2k~ a , so 

fi(U) ^ \U\/(2k- a m) = \U\ s \U\ l - s /(2k- a m) ^ \U\ s (2k~ a ) l - s /(2k- a (k\I\ - 2)) 
^ \U\ s 2~ s k sa - l /(\I\ -2k- 1 ). (8.19) 

With / and e > given, then, provided k is sufficiently large, 

M (£/)s;(i + £)i£/i7m' 5 

for all covering intervals U, using (8.18) and (8.19). Hence if / C\ E% C U; ^' 
then 

i = A*(/ n E t ) ^ ^ M(t/i) < (i + s)\ir s J2 w 

i i 

so H^il n E k ) ^ |/|'V(1 + e), from which (8.17) follows. □ 

In this example, F belongs to C s (— oo, oo) if s < l/a, so dime/ 7 ^ 1/a, by 
Proposition 8.5. Moreover, it is clear that the translate F + x is in C s (— oo, oo) for 
any real number x, so by Proposition 8.6 {^L\{F + x,) belongs to C(— oo, oo) 
for any countable set X\, X2, ■ ■ ., implying that dim H H^i (F + *«') ^ l/ a - More 
generally, f(F) is in C s (— 00, 00) for all 'reasonable' functions / by Proposition 
8.8, and this generates a large stock of C s sets, countable intersections of which 
also have dimension at least l/a. 

In Section 10.3 we shall indicate how Example 8.9 may be improved to give F 
in C s (— 00, 00) for all s < 2/a, with corresponding consequences for dimensions. 

8.3 Notes and references 

The study of intersections of sets as they are moved relative to one another is part 
of a subject known as integral geometry. A full account in the classical setting is 
given by Santalo (1976). The main references for the fractal intersection formula 
of Section 8.1 are Kahane (1986) and Mattila (1984, 1985, 1995). 

For the strange properties of packing dimension of intersections of sets, see 
Falconer, Jarvenpaa and Mattila (1999) and Csornyei (2001). 

There are several definitions of classes of sets with large intersections, such 
as those given by Baker and Schmidt (1970) and Falconer (1985b). An impor- 
tant class of 'ubiquitous' systems of sets was introduced by Dodson, Rynne 
and Vickers (1990), see also Bernik and Dodson (1999), and their 'Ubiquity 
theorem' provides a powerful technique for finding dimensions of lim sup sets. 
Rynne (1992) compares different definitions of large intersection classes. Fal- 
coner (1994) gives a general theory of large intersection sets and also considers 
their packing dimensions. 
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Exercises 

8.1 Let E and F be rectifiable curves in R 2 and let a be a rigid motion. Prove Poincare's 
formula of classical integral geometry 



/ 



4 x length (E) length (F) = / (number of points in (E n a(F)))Aa 

where integration is with respect to the natural measure on the set of rigid motions. 
(Hint: show this first when E and F are line segments, then for polygons, and obtain 
the general result by approximation.) 
.2 Show that if a curve C bounds a (compact) convex set in the plane, then the length 
of C is given by 



L Je=o 



length (proj e C)d6>. 



(Hint: take E as C and F as a long line segment in the result of Exercise 8.1.) 

8.3 In the plane, let E be the product of two middle third Cantor sets and let F be (i) a 
circle, (ii) the von Koch curve and (iii) a congruent copy of E. In each case, what can 
be said about the Hausdorff dimension of E H a(F) for congruence transformations 
o-? 

8.4 Show that the conclusion of Theorem 8.1 may be extended to give that E n (F + x) 
is empty for almost all x if dim H (£' x F) < n. 

8.5 By taking £ as a suitable set dense in a region of IR 2 and F as a unit line segment, 
show that (8.5) fails if Hausdorff dimension is replaced by box dimensions, even for 
the group of similarities. 

8.6 Let 1 < s < 2. Construct a plane s-set F in the unit disc B such that if E is any 
straight line segment of length 2 that intersects the interior of B then £ n F is an 
(s - l)-set. 

8.7 Let E be the set of parallel line segments 



{(0, 0)} U [J{(x, «- 1/2 ) : < x < n- l/2 } 



Show that dim p E > 4/3, but that dim B (L PI E) = for all lines L that do not pass 
through the origin. (Hint: see Example 3.5.) 
8.8 Let Ek be the set of real numbers with base-3 expansion m ■ aid2 ■ ■ ■ such that a^ = 
or 2. Show that F = fim /i: ^ 0O £' i is in class C s (— oo, oo) for all < s < 1. (Note that F 
is the set of numbers with infinitely many base-3 digits different from 1.) Deduce that 
dim H -F = 1 and that dim H (n^i (F + •*<)) = 1 f° r an Y countable sequence xi, X2, ■ ■ ■ ■ 
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Chapter 9 Iterated function 

systems — self-similar 
and self-affine sets 



9.1 Iterated function systems 

Many fractals are made up of parts that are, in some way, similar to the whole. 
For example, the middle third Cantor set is the union of two similar copies of 
itself, and the von Koch curve is made up of four similar copies. These self- 
similarities are not only properties of the fractals: they may actually be used to 
define them. Iterated function systems do this in a unified way and, moreover, 
often lead to a simple way of finding dimensions. 

Let D be a closed subset of W, often D —W 1 . A mapping S : D — > D is 
called a contraction on D if there is a number c with < c < 1 such that 
\S(x) — S(y)\ ^ c\x — y\ for all x, y e D. Clearly any contraction is continuous. 
If equality holds, i.e. if \S(x) — S(y)\ — c\x — y\, then S transforms sets into 
geometrically similar sets, and we call S a contracting similarity. 

A finite family of contractions {Si, S2, • ■ ■ , S m ], with m ^ 2, is called an iter- 
ated function system or IFS. We call a non-empty compact subset F of D an 
attractor (or invariant set) for the IFS if 



LU(^). 



The fundamental property of an iterated function system is that it determines a 
unique attractor, which is usually a fractal. For a simple example, take F to be 
the middle third Cantor set. Let Si , S2 : D& — > R be given by 

Si(x) = \x; S 2 (*) = |* + f. 

Then Si(F) and 82(F) are just the left and right 'halves' of F, so that F — 
S\(F) U S2(F); thus F is an attractor of the IFS consisting of the contractions 
{Si, S2}, the two mappings, which represent the basic self-similarities of the 
Cantor set. 
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Figure 9.1 The Hausdorff distance between the sets A and B is the least 5 > such that 
the ^-neighbourhood A s of A contains B and the & -neighbourhood B$ of B contains A 

We shall prove the fundamental property that an IFS has a unique (non-empty 
compact, i.e. closed and bounded) attractor. This means, for example, that the 
middle third Cantor set is completely specified as the attractor of the mappings 
{Si, S2} given above. 

To this end, we define a metric or distance d between subsets of D. Let S denote 
the class of all non-empty compact subsets of D. Recall that the ^-neighbourhood 
of a set A is the set of points within distance S of A, i.e. A s — {x e D : \x — a\ SC 
S for some a e A}. We make S into a metric space by defining the distance 
between two sets A and B to be the least S such that the S -neighbourhood of A 
contains B and vice versa: 

d(A, B) = inf{<5 : A C B s and B C A s ) 

(see figure 9.1). A simple check shows that J is a metric or distance function, that 
is, satisfies the three requirements (i) d(A, B) ^ with equality if and only if A = 
B, (ii) d(A, B) = d(B, A), (iii) d(A, B) ^ d(A, C) + d(C, B) for all A,B,C e 
S. The metric d is known as the Hausdorff metric on S. In particular, if d(A, B) 
is small, then A and B are close to each other as sets. 

We give two proofs of the fundamental result on IFSs. The first depends on 
Banach's contraction mapping theorem, and the second is direct and elementary. 

Theorem 9.1 

Consider the iterated function system given by the contractions {S\, . . . , S m ] on 
DC R", so that 

\Si(x) - Si(y)\ ^ c,\x - y\ (x,y)eD (9.1) 

with ci < I for each i. Then there is a unique attractor F , i.e. a non-empty compact 
set such that 



F = {jSi(F). (9.2) 



;=i 
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Moreover, if we define a transformation S on the class S of non-empty compact 
sets by 

m 

S(E) = \JSi(E) (9.3) 



for E e S, and write S k for the kth iterate of S (so S°(E) — E and S k (E) = 
S(S k - l (E))for k > I), then 



F = (~)s k (E) (9.4) 



k=0 

for every set E e S such that Si(E) C E for all i. 

First proof. Note that sets in S are transformed by S into other sets of S. If 
A, B e S then 

(m m \ 

I) Si(A), I) St(B) I s: max d(S t (A), S t (B)) 
i = l i = l / 

using the definition of the metric d and noting that if the S -neighbourhood 
(Si(A)) s contains Sj(B) for all i then (U*=i Si(A)) s contains Uf =1 Si(B), and 
vice versa. By (9.1) 

d(S(A), 5(B)) s: ( max Ci)d(A, B). (9.5) 

It may be shown that d is a complete metric on S, that is every Cauchy sequence 
of sets in S is convergent to a set in S. Since < maxi^,^ m c, < 1, (9.5) states that 
S is a contraction on the complete metric space (S, d). By Banach's contraction 
mapping theorem, S has a unique fixed point, that is there is a unique set F e S 
such that S(F) — F, which is (9.2), and moreover S k (E) — > F as k — > oo. In 
particular, if 5,-(£) C £ for all i then 5'(£') C £, so that S k (E) is a decreasing 
sequence of non-empty compact sets containing F with intersection Htlo S k (E) 
which must equal F, □ 

Second proof. Let £ be any set in 5 such that Sj(E) C £ for all ;'; for example 
E — D n .6(0, r) will do provided r is sufficiently large. Then S k (E) C S k ~ 1 (E), 
so that S k (E) is a decreasing sequence of non-empty compact sets, which nec- 
essarily have non-empty compact intersection F — Htli S k (E). Since S k (E) is 
a decreasing sequence of sets, it follows that S(F) — F , so F satisfies (9.2) and 
is an attractor of the IFS. 

To see that the attractor is unique, we derive (9.5) exactly as in the first proof. 
Suppose A and B are both attractors, so that S(A) — A and S(B) — B. Since < 
maxi^,^ m c, < 1 it follows from (9.5) that d(A, B) — 0, implying A — B. □ 
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There are two main problems that arise in connection with iterated function 
systems. The first problem is to represent or 'code' a given set as the attractor 
of some IFS, and the second is to 'decode' the IFS by displaying its attractor. 
In both cases, we may wish to go on to analyse the structure and dimensions of 
the attractor, and the IFS can be a great aid in doing this. 

Finding an IFS that has a given F as its unique attractor can often be done 
by inspection, at least if F is self-similar or self-affine. For example, the Cantor 
dust (figure 0.4) is easily seen to be the attractor of the four similarities which 
give the basic self-similarities of the set: 

Siix, y) = (\x, \y + i), S 2 (x, y) = (\x + i \y), 



S 3 (x, y) = {\x + \,\y+ §), S 4 (x, y) = (\x + \, \y + ±). 



In general it may not be possible to find an IFS with a given set as attractor, but 
we can normally find one with an attractor that is a close approximation to the 
required set. This question of representing general objects by IFSs is considered 
in Section 9.5. 

The transformation S introduced in Theorem 9.1 is the key to computing the 
attractor of an IFS; indeed (9.4) already provides a method for doing so. In 
fact, the sequence of iterates S k (E) converges to the attractor F for any ini- 
tial set E in S, in the sense that d(S k {E), F) — > 0. This follows since (9.5) 
implies that d(S(E), F) = d(S(E), S(F)) sC cd(E, F), so that d(S k (E), F) sC 
c k d(E,F), where c = maxi^ m Ci < 1. Thus the S k (E) provide increasingly 
good approximations to F, If F is a fractal, these approximations are sometimes 
called pre-fractals for F . 

For each k 

S k (E) = |J S h ° ■ ■■oS k (E) = |J S h (S i2 (- ■ ■ (S ik (E)) ■ ■ ■)) (9.6) 



where the union is over the set Xj of all A:-term sequences (ii,...,/*) with 
1 ^ ij ^ m; see figure 9.2. (Recall that 5,-,° ■ ■ ■ oS ik denotes the composition of 
mappings, so that (S;,© . . . oS it )(x) = S h (S h (- ■ ■ (S k (x)) ■ ■ ■))•) If S t (E) is con- 
tained in E for each i and x is a point of F, it follows from (9.4) that there is 
a (not necessarily unique) sequence (i\, i 2 , ■ ■ ■) such that x e S;,° ■ ■ ■ oS ik (E) for 
all k. This sequence provides a natural coding for x, with 

x=x h , h ,... = f]s h o...o Sit (E), (9.7) 

k=i 



so that F = Utai.ia,...}- 

This expression for x,-, , 2 ... is independent of E provided that St (E) is contained 
in E for all i. 
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«i(Si(£)) 



SiiS^E)) 



S 2 (S 1 (E)) 



S 2 (S 2 (E)) 




Figure 9.2 Construction of the attractor F for contractions S\ and S 2 which map the large 
ellipse E onto the ellipses S^E) and S 2 (E). The sets S k (E) = |J. =1 2 5,,o . . . o S h (E) 
give increasingly good approximations to F 



Notice that if the union in (9.2) is disjoint then F must be totally disconnected 
(provided the 5,- are injections), since if ^,,,, 2 ,... ^ x i',i',... we ma Y find A: such 



that 0'[ 



4) ^£ («' . . . , i' k ) so that the disjoint closed sets S^o ■ ■ ■ oS ik (F) and 



5 1 ;, o . . . oSf (F) disconnect the two points. 

Again this may be illustrated by Si (x) — ^x, S2(x) = \x + I and F the Cantor 
set. If E — [0, 1] then S k (E) — E^, the set of 2 k basic intervals of length 3~ k 
obtained at the fcth stage of the usual Cantor set construction; see figure 0.1. 
Moreover, x il ; 2j is the point of the Cantor set with base-3 expansion ■ a t a2 ■ ■ ■ , 
where a^ — if 4 = 1 and a^ — 2 if 4 = 2. The pre-fractals S k (E) provide 
the usual construction of many fractals for a suitably chosen initial set E; the 
5,-jQ ■ ■ ■ oSi k (E) are called the level-k sets of the construction. 

This theory provides us with two methods for computer drawing of IFS attrac- 
tors in the plane, as indicated in figure 9.3. For the first method, take any initial 
set E (such as a square) and draw the feth approximation S k (E) to F given by 
(9.6) for a suitable value of k. The set S k (E) is made up of m k small sets — either 
these can be drawn in full, or a representative point of each can be plotted. If 
E can be chosen as a line segment in such a way that S](E), . . . , S m (E) join 
up to form a polygonal curve with endpoints the same as those of E, then the 
sequence of polygonal curves S k (E) provides increasingly good approximations 
to the fractal curve F. Taking E as the initial interval in the von Koch curve 
construction is an example of this, with S k (E) just the kth step of the construc- 
tion (Et in figure 0.2). Careful recursive programming is helpful when using 
this method. 
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,'y "C- --s."^ 1 \ 






V- I l_ 






5 2 (E) 


I "" "'~ 




S t (E) 


(*<•-- ^~~z.zjz 




S 3 (E) 






(a) 



^ri- : ^?:. 



.■,x:- ^>^< 



E (b) 

Figure 9.3 Two ways of computer drawing the attractor F of the IFS consisting of the 
three affine transformations S\ , S2 and 53 which map the square onto the rectangles. 
In method (a) the 3* parallelograms 5,-, (S,- 2 (- • • (S it (E)) ■ ■ •)) for ij = 1, 2, 3 are drawn 
(k = 6 here). In method (fc) the sequence of points x k is plotted by choosing S it at random 
from Si, S2 and 53 for successive k and letting Xk = 5 !t (jct_i) 



For the second method, take Xo as any initial point, select a contraction 5,-, 
from Si, . . . , S m at random, and let x\ — S^ixo). Continue in this way, choos- 
ing Si k from Si , . . . , S m at random (with equal probability, say) and letting 
x^ — Si k (xic-i) for k — 1,2,.... For large enough k, the points x^ will be indis- 
tinguishably close to F, with x^ close to 5^°- ■ ■°Si 1 (F), so the sequence {x^} 
will appear randomly distributed across F . A plot of the sequence {x^} from, 
say, the hundredth term onwards may give a good impression of F, (It is a con- 
sequence of ergodic theory that, with probability 1, this sequence of points will 
fill F, in a manner that approximates a certain measure on F.) 



9.2 Dimensions of self-similar sets 

One of the advantages of using an iterated function system is that the dimension of 
the attractor is often relatively easy to calculate or estimate in terms of the defining 
contractions. In this section we discuss the case where Si, . . . , S m : R" — >■ R" are 
similarities, i.e. with 



\Si(x) - Si(y)\ = a\x - y\ (x,yeK n ) 



(9.8) 



where < c, < 1 (c, is called the ratio of Si). Thus each 5, transforms sub- 
sets of E." into geometrically similar sets. The attractor of such a collection of 
similarities is called a (strictly) self-similar set, being a union of a number of 



Dimensions of self-similar sets 129 

smaller similar copies of itself. Standard examples include the middle third Can- 
tor set, the Sierpiriski triangle and the von Koch curve, see figures 0.1-0.5. We 
show that, under certain conditions, a self-similar set F has Hausdorff and box 
dimensions equal to the value of s satisfying 

m 

and further that F has positive and finite H s -measure. A calculation similar to the 
'heuristic calculation' of Example 2.7 indicates that the value given by (9.9) is at 
least plausible. If F — U^Li $(F) with the union 'nearly disjoint', we have that 

m m 

H S {F) = J2^ s (Si(F)) = J^c s i n s (F) (9.10) 

1=1 1=1 

using (9.8) and Scaling property 2.1. On the assumption that < Ti s (F) < oo at 
the 'jump' value s — dime/ 7 , we get that s satisfies (9.9). 

For this argument to give the right answer, we require a condition that ensures 
that the components Sj(F) of F do no overlap 'too much'. We say that the S,- 
satisfy the open set condition if there exists a non-empty bounded open set V 
such that 



VD\JSi(V) (9.11) 



;=i 

with the union disjoint. (In the middle third Cantor set example, the open set 
condition holds for S\ and S2 with V as the open interval (0, 1).) We show that, 
provided that the similarities 5, satisfy the open set condition, the Hausdorff 
dimension of the attractor is given by (9.9). 
We require the following geometrical result. 

Lemma 9.2 

Let { V,} be a collection of disjoint open subsets of W such that each Vj contains 
a ball of radius air and is contained in a ball of radius a^r . Then any ball B of 
radius r intersects at most (1 + 1a-i) n a[ n of the closures V -,. 

Proof. If Vj meets B, then V, is contained in the ball concentric with B of 
radius (1 + 2a2)r. Suppose that q of the sets V ',- intersect B. Then, summing the 
volumes of the corresponding interior balls of radii a\r, it follows that q(a\r) n SC 
(1 + 2a-i) n r n , giving the stated bound for q. □ 

The derivation of the lower bound in the following theorem is a little awk- 
ward. The reader may find it helpful to follow through the proof with the 



130 Iterated function systems — self-similar and self-affine sets 

middle third Cantor set in mind, or by referring to the 'general example' of 
figure 9.2. Alternatively, the proof of Proposition 9.7 covers the case when the 
sets S\(F), . . . , S m (F) are disjoint, and is rather simpler. 



Theorem 9.3 

Suppose that the open set condition (9.1 1) holds for the similarities Si on R" with 
ratios < c, < 1 for 1 $C i ^ m. If F is the attractor of the IFS [Si, . . . , S m }, 
that is 

m 

F = \Js i (F), (9.12) 

;=i 

then dim H F — dim B F — s, where s is given by 

in 

Moreover, for this value of s, < H S (F) < oo. 

Proof. Lets satisfy (9.13). Let2jt denote the set of all £-term sequences (;'i , . . . , it) 
with 1 < ij ^ m. For any set A and (/[,.,., 4) e X k we write A iu ik — 
SjjQ . . . oS ik (A). It follows, by using (9.12) repeatedly, that 



IM 






We check that these covers of F provide a suitable upper estimate for the Haus- 
dorff measure. Since the mapping S/,° ■ ■ ■ oS ik is a similarity of ratio c,-, ■ ■ ■ c,- t , 
then 



J k l t \ i, I \ i t I 



(9.14) 
by (9.13). For any S > 0, we may choose k such that \F il _ . ik \ < (max,- Cj) k \F\ ^ 
<5, so H S S (F) sC \F\ S and hence W(F) sC \F\ S . 

The lower bound is more awkward. Let X be the set of all infinite sequences X — 
{(('i, i 2 , ■ ■ ■) : 1 ^ ij ^ m], and let I iu ..., ik = {(h, ..., i k , qk+u ■ ■ •) : 1 ^ qj ^ m\ 
be the 'cylinder' consisting of those sequences in X with initial terms (i\, . . , , j#). 
We may put a mass distribution /ionl such that /x(/, 1 ,...., /t ) = (c,-, ■ ■ ■ c, t ) s . Since 

(c ; , ■ ■ ■ c ik ) s = Er=i( c 'i ■ ■ ■ c i k c iYi i- e - M(Ai,...,i t ) = E"Li l*(Ih,...,h,i)> ^ follows 
that /x is indeed a mass distribution on subsets of J with /i(X) — 1. We 
may transfer /x to a mass distribution /i on F in a natural way by defining 
/2(A) = /x{(('i, ?2, . . .) : X,-, ,•, e A} for subsets A of F. (Recall that x i{ ,- 2: . = 



Dimensions of self-similar sets 131 

f\=i Fh,...,i k ) Thus the /2-mass of a set is the /x-mass of the corresponding 
sequences. It is easily checked that jl(F) — 1. 

We show that /x satisfies the conditions of the Mass distribution principle 4.2. 
Let V be the open set of (9.11). Since V D S(V) = Uf=i S t (V), the decreas- 
ing sequence of iterates S k (V) converges to F; see (9.4). In particular V D F 

and Vj! ,- 4 D F iu j k for each finite sequence (i\, , , . , 4). Let £ be any ball of 

radius r < 1. We estimate /i(5) by considering the sets V,, .. (i with diameters 
comparable with that of B and with closures intersecting FOB. 

We curtail each infinite sequence (ii , l2> • • •) £ 2" after the first term £* for which 



min a r ^ c (1 c, 2 • • • c h ^ r (9.15) 

and let Q denote the finite set of all (finite) sequences obtained in this way. 
Then for every infinite sequence (i u ii, ■ . .) el there is exactly one value of k 

with 0'i, . . . , i k ) e Q. Since Vi, . . . , V m are disjoint, so are V,-,,...,,^,!, . . . , V;^ ,- tifII 

for each (ii, . . . , **)• Using this in a nested way, it follows that the collection 
of open sets {V,,,...^ : 0'i, . . . , 4) e Q) is disjoint. Similarly F C Uo F iu ^ ik C 

We choose a t and «2 s ° that V contains a ball of radius a\ and is contained in 
a ball of radius a-i- Then, for all (i\, . . . , 4) £ Q, the set Vj 1> ,.. i ,- 4 contains a ball 
of radius c,-, ■ ■ ■ C !t ai and therefore one of radius (min, Ci)a\r, and is contained 
in a ball of radius q, ■ ■ ■ c^a^ and hence in a ball of radius air . Let Q! denote 
those sequences (i\, . . , , *'#) in Q such that 5 intersects Vj lt ...j t . By Lemma 9.2 
there are at most q — (1 + 2a2)"<2i~"( mm ' c<)~" sequences in Q\. Then 

£(fl) = fr(F n B) = /xlO'i, » 2 , . . .) : x,- 1>fa ,... eFHB} 



<M (J 7 '.. -.'* 



since, if jc, 1i( ' 2 ,... ef flBc Uo V,- x ,- t , then there is an integer k such that 

(ii, ...,iit)eQi, Thus 



A(B) S^ltW,....,,-*) 



Si Si 

using (9.15). Since any set [/ is contained in a ball of radius \U\, we have 
fi(U) ^ |t/| ? ^, so the Mass distribution principle 4.2 gives TL S (F) ^ q~ l > 0, 
and dime/ 7 = .?. 

If Q is any set of finite sequences such that for every (l'i, J2> • • •) £ 2" there 
is exactly one integer & with (i\, . , . , ^) e Q, it follows inductively from (9.13) 
that X1q( c ii c 12 ' ' ' c 's) s — 1- Thus, if Q is chosen as in (9.15), Q contains at 
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most (min,- Ci)~ s r~ s sequences. For each sequence (ii,...,*'*) e Q we have 

[ V,-, ,- t | — c , i l •••c,- t |V| ^ r\V\, so F may be covered by (min,- ci)~ s r~ s sets 

of diameter r\V\ for each r < 1. It follows from Equivalent definition 3.1(iv) 
that dim B ,F SC s; noting that s — dim^F SC dim p F $C dim B F $C s, using (3.17), 
completes the proof. □ 

If the open set condition is not assumed in Theorem 9.3, it may be shown that 
we still have dim H F — dimB^ though this value may be less than s. 

Theorem 9.3 enables us to find the dimension of many self-similar fractals. 

Example 9.4. Sierpihski triangle 

The Sierpiriski triangle or gasket F is constructed from an equilateral triangle by 
repeatedly removing inverted equilateral triangles; see figure 0.3. Then dimnF = 
dime F — log 3/ log 2. 



Calculation. The set F is the attractor of the three obvious similarities of ratios 
j which map the triangle Eq onto the triangles of E l . The open set condition 
holds, taking V as the interior of Eq. Thus, by Theorem 9.3, dim H F — dim B F = 
log 3/ log 2, which is the solution of 3(i) s = £1(5)" = 1. □ 

The next example involves similarity transformations of more than one ratio. 




•£, 



E<-F 



Figure 9.4 Construction of a modified von Koch curve — see Example 9.5 
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Example 9.5. Modified von Koch curve 

Fix < a sj t and construct a curve F by repeatedly replacing the middle pro- 
portion a of each interval by the other two sides of an equilateral triangle; see 
figure 9.4. Then dim H .F = dim B .F is the solution of 2a s + 2(i(l — a)) s — 1. 

Calculation. The curve F is the attractor of the similarities that map the unit 
interval onto each of the four intervals in E\. The open set condition holds, 



Generator 





E,~F 



Figure 9.5 Stages in the construction of a fractal curve from a generator. The lengths of 
the segments in the generator are |, i, |, i, |, and the Hausdorff and box dimensions of 
F are given by 3(±) v + 2{\) s = 1 or s = 1.34 . . . 
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Figure 9.6 A fractal curve and its generator. The Hausdorff and box dimensions of the 
curve are equal to log 8/ log 4 = 1 i 

taking V as the interior of the isosceles triangle of base length 1 and height 
ja^/3, so Theorem 9.3 gives the dimension stated. □ 

There is a convenient method of specifying certain self-similar sets diagram- 
matically, in particular self-similar curves such as Example 9.5. A generator 
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Figure 9.7 A tree-like fractal and its generator. The Hausdorff and box dimensions are 
equal to log 5/ log 3 = 1.465 . . . 



Some variations 135 

consists of a number of straight line segments and two points specially identi- 
fied. We associate with each line segment the similarity that maps the two special 
points onto the endpoints of the segment. A sequence of sets approximating to the 
self-similar attractor may be built up by iterating the process of replacing each 
line segment by a similar copy of the generator; see figures 9.5-9.7 for some 
examples. Note that the similarities are defined by the generator only to within 
reflection and 180° rotation but the orientation may be specified by displaying 
the first step of the construction. 



9.3 Some variations 

The calculations underlying Theorem 9.3 may be adapted to estimate the dimen- 
sion of the attractor F of an IFS consisting of contractions that are not similarities. 

Proposition 9.6 

Let F be the attractor of an IFS consisting of contractions {Si, . . . , S m ] on a 
closed subset D of W such that 

\Si(x) - Si(y)\ ^ Ci\x - y\ (x,yeD) 

with < Ci < I for each i. Then dime-F $5 s and dimB^ $5 s, where Y17=\ c t = !• 

Proof. These estimates are essentially those of the first and last paragraphs of the 
proof of Theorem 9.3, noting that we have the inequality | A,-, ,- t | < c,-, ■ ■ • c,-J A| 
for each set A, rather than equality. □ 

We next obtain a lower bound for dimension in the case where the components 
Si (F) of F are disjoint. Note that this will certainly be the case if there is some 
non-empty compact set E with S,-(2T) C E for all i and with the Sj(E) disjoint. 

Proposition 9.7 

Consider the IFS consisting of contractions {S\, . . . , S m } on a closed subset D of 
W such that 

b t \x - y\ ^ \Si(x) - Si(y)\ (x, y e D) (9.16) 

with < bi < 1 for each i. Assume that the (non-empty compact) attractor 
F satisfies 

in 

F = \JSi(F), (9.17) 

! = 1 
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with this union disjoint. Then F is totally disconnected and divci^F Jj s where 






1. 



(9.18) 



Proof. Let d > be the minimum distance between any pair of the disjoint com- 
pact sets S\(F), . . . , S m (F), i.e. d — min,-^- inffjx — y\ : x e S t (F), y e Sj(F)}. 
Let F iu .., Jt — 5,-,o ■ ■ ■ oS h (F) and define \x by /x(f,„... (i ) = (b h ■ ■ ■ b ik ) s . Since 

m m 

■b ik Y = n(Fiu-Jk) 



i=\ 
(b 



H 




it follows that /x defines a mass distribution on F with n(F) — 1. 

If x e F, there is a unique infinite sequence i\, ii, ■ ■ ■ such that x e ^,,....i t for 
each k. For < r < d let k be the least integer such that 



> k d ^r < b h 



K-A- 



If i[, . . . , i' k is distinct from ji, , , . , 4, the sets F ilt ik and iy ... ,•' are disjoint and 
separated by a gap of at least b tl ■ ■ ■ b ik ,<f > r. (To see this, note that if j is the 
least integer such that ij ^ i'- then Fj, . , iJt C Fj and iy . .y C iy. are separated 
by <i, so F iu ik and F t > ^> are separated by at least Z? (1 ■ ■ ■ b, _,<i.) It follows that 

fnfifi.ocfi, ,,'so'' 

H{F n £(*, r)) $: li(F h _ ik ) = (b h . ..b ik Y ^ d~ s r s . 

If U intersects F, then U C B(x, r) for some x e Z 7 with r — \U\. Thus /x((/) $; 
d _,! |t/|' s , so by the Mass distribution principle 4.2 H S (F) > and dime/ 7 ^ 5. 
The separation indicated above implies that F is totally disconnected. □ 



Example 9.8. 'Non-linear' Cantor set 



Let D 



(1 + V3), (1 + V3) 



and let S\, S 2 '■ D — >■ D be given by 

S x (x) = 1 + 1/jc, S 2 (x) = 2+ l/x. Then 0.44 < dim^ ^ dim B ^ sC dime/ 7 < 
0.66 where F is the attractor of {Si, S 2 }- (This example arises in connection with 
number theory; see Section 10.2.) 



Calculation. We note that Si(D) 



(1 + V3),V3 



and S 2 (D) 



(3 + V3), 



1 + V3 so we can use Propositions 9.6 and 9.7 to estimate dim H F. By 
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the mean value theorem (see Section 1.2) if x, y G D are distinct points then 
(Si(x) — Si(y))/(x — y) — S'jizi) for some Zi £ D. Thus for i = 1,2, 

inf \S(x)\ ^ l -^Y r^ ^ sup \S(x)\. 

Since S[(x) — S' 2 (x) — —l/x 2 it follows that 



(2 - \/3) = (1 + V3)" 2 ^ i-i^ ^ ^ (i(l + V3))' 2 = 2(2 - V3) 

I* — yl 



for both i — 1 and i — 2. According to Propositions 9.6 and 9.7 lower and 
upper bounds for the dimensions are given by the solutions of 2(i(2 — 
V3)) 1 = 1 and 2(2(2 - V3))'< = 1 which are s = log 2/ log(2(2 + 73)) = 0.34 
and log 2/ log (i (2 + V3)) = 1.11 respectively. 

For a subset of the real line, an upper bound greater than 1 is not of much 
interest. One way of getting better estimates is to note that F is also the attractor 
of the four mappings on [0, 1] 

SfOSj = i + I/O" + l/x) = i + x/{jx + 1) (i, j = 1, 2). 

By calculating derivatives and using the mean-value theorem as before, we 
get that 

(s^ym = (jx + iy 2 

so 

(7(1 + V3) + iy 2 \x - y| < |5,°5 ; (x) - 5,°^(y)| < (iy(l + V3) + l)~ 2 \x - y\. 



Lower and upper bounds for the dimensions are now given by the solutions 
of 2(2 + V3)~ 2s + 2(3 + 2VJT 2 ' S = 1 and 2(i(3 + V3)y 2s + 2(2 + V3)~ 2s = 
1, giving 0.44 < dim H F < 0.66, a considerable improvement on the previous 
estimates. In fact dimuF = 0.531, a value that may be obtained by looking at 
yet higher-order iterates of the S, . □ 

*[The rest of this subsection may be omitted.] 

The technique used in Example 9.8 to improve the dimension estimates is 
often useful for attractors of transformations that are not strict similarities. If 
F is the attractor for the IFS {S\, . . . , S m ] on D then F is also the attractor 
for the IFS consisting of the m k transformations {S^o • • • °St k } for each k. If 
the Si are, say, twice differentiable on an open set containing F ', it may be 
shown that when k is large, the contractions 5,-j ■ ■ ■ °5,- t are in a sense, close 
to similarities on D. In particular, for transformations on a subset D of IR, if 
b = inf^zj \(S h o ■ ■ ■ oS it )'(x)\ and c = snp xeD |(S,- ° ■ ■ ■ oS h )'(x)\, then 



blx-yi^lS.o 



■ ■ ■ oS ik (x) -S h o... oS ik (y)\ s: c\x - y\ (x, y e D) 
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If k is large then b/c will be close to 1, and applying Propositions 9.6 and 9.7 
to the m k transformations S^o ■ ■ ■ oS ik gives good upper and lower estimates for 
the dimensions of F . 

We may take this further. If the 5,- are twice differentiable on a subset D of R, 

\S i o---oS i ,(x)-S i o...oS i .(y)\ 

^ ±-~ '■! ^Zi~ \(SiO...OS;Y(w)\ 

\x-y\ ' * 

for large k, where x, y and w are any points of D. The composition of mappings 
5,-j ■ ■ ■ °Si t is close to a similarity on D, so by comparison with Theorem 9.3 
we would expect the dimension of the attractor F to be close to the value of s 
for which 

J^\(S h o...o Sik y(w)\ s = l (9.19) 

It 

where the sum is over the set X* of all fe-term sequences. This expectation moti- 
vates the following theorem. 

Theorem 9.9 

Let V C K be an open interval. Let S\ , . . . , S m be contractions on V that are 
twice differentiable on V with a ^ |S ( '(u>)| ^ c for all i and w e V, where < 
a $C c < 1 are constants. Suppose that the Si satisfy the open set condition (9.11) 
with open set V. Then the limit 



lim 

k— >oo 



EiWi°-°^)'wr 



\/k 



= <p(s) (9.20) 



exists for each s > 0, is independent of we V, and is decreasing in s. If F is 
the attractor of {S\, . . . , S m ) then dimn^ = dim^F is the solution of cp(s) — 1, 
and F is an s-set, i.e. < H S (F) < oo for this value of s. 

Note on Proof. The main difficulty is to show that the limit (9.20) exists — this 
depends on the differentiability condition on the 5,. Given this, the argument 
outlined above may be used to show that the value of s satisfying (9.19) is a 
good approximation to the dimension when k is large; letting k — > oo then gives 
the result. 

Similar ideas, but involving the rate of convergence to the limit in (9.20), are 
needed to show that < H S {F) < oo. □ 

There are higher-dimensional analogues of Theorem 9.9. Suppose that the con- 
tractions S\ , . . . , S m on a domain D in the complex plane are complex analytic 
mappings. Then the 5, are conformal, or in other words are locally like similarity 
transformations, contracting at the same rate in every direction. We have 

Siiz) — Si(zo) + S'f(zo)(z - Zq) + terms in (z - z ) 2 and higher powers 
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so that if z — Zo is small 

S t (z) ^ Sdzo) + SKzoHz - zo) (9.21) 

where S'jizo) is a complex number with [^'(zo)! < 1- But the right-hand side of 
(9.21) is just a similarity expressed in complex notation. In this setting, Theo- 
rem 9.9 holds, by the same sort of argument as in the 1 -dimensional case. 

Results such as these are part of the 'thermodynamic formalism', a body of 
theory that leads to dimension formulae for many attractors. 

9.4 Self-affine sets 

Self-affine sets form an important class of sets, which include self-similar sets as 
a particular case. An affine transformation S : E." ^ U." is a transformation of 
the form 

S(x) = T{x) + b 

where T is a linear transformation on IR" (representable by an n x n matrix) 
and b is a vector in W l . Thus an affine transformation S is a combination of 
a translation, rotation, dilation and, perhaps, a reflection. In particular, S maps 
spheres to ellipsoids, squares to parallelograms, etc. Unlike similarities, affine 
transformations contract with differing ratios in different directions. 

If an IFS consists of affine contractions {S\, . . . , S m } on R", the attractor F 
guaranteed by Theorem 9.1 is termed a self-affine set. An example is given in 
figure 9.8: Si, S2 and S3 are defined as the transformations that map the square 
E onto the three rectangles in the obvious way. (In the figure the attractor F 
is represented as the aggregate of S^o ■ ■ ■ oS it (E) over all sequences (ii, . . . , ik) 
with ij — 1, 2, 3 for suitably large k. Clearly F is made up of the three affine 
copies of itself: S\(F), 82(F) and S^(F).) 

It is natural to look for a formula for the dimension of self-affine sets that gen- 
eralizes formula (9.13) for self-similar sets. We would hope that the dimension 
depends on the affine transformations in a reasonably simple way, easily express- 
ible in terms of the matrices and vectors that represent the affine transformation. 
Unfortunately, the situation is much more complicated than this — the following 
example shows that if the affine transformations are varied in a continuous way, 
the dimension of the self-affine set need not change continuously. 

Example 9.10 

Let Si , S2 be the affine contractions on U 2 that map the unit square onto the 
rectangles R\ and R2 of sides j and e where < e < \, as in figure 9.9. The 
rectangle R\ abuts the y-axis, but the end of R2 is distance ^ X SC j from the 
y-axis. If F is the attractor of [S\, S2}, we have dimn-f ^ 1 when A. > 0, but 
dim H F = log 2/ — log e < 1 when X — 0. 
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Figure 9.8 A self-affine set which is the attractor of the affine transformations that map 
the square E onto the rectangles shown 
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Figure 9.9 Discontinuity of the dimension of self-affine sets. The affine mappings S\ and 
52 map the unit square E onto R\ and R2. In (a) X > and dimni 7 > dim H proj F = 1, 
but in (Zj) 1 = 0, and dimnf = log 2/ — log e < 1 
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Calculation. Suppose X > (figure 9.9(a)). Then the £th stage of the construction 
Efr = [J Si t o ■ ■ -oS ik (E) consists of 2 k rectangles of sides 2~ k and s k with the 
projection of E^ onto the x-axis, proj£^, containing the interval [0,21]. Since 
F — n,^i Ek it follows that proj F contains the interval [0, 2k]. (Another way 
of seeing this is by noting that proj F is the attractor of Si , S2 '■ ^ — > K given by 
S\(x) = \x, S2OO — jX + X, which has as attractor the interval [0, 2A.].) Thus 
dim^F ^ dimnproj F — dimn[0, 2X] — 1. 

If X — 0, the situation changes (figure 9.9(b)). E^ consists of 2 k rectangles of 
sides 2~ k and e k which all have their left-hand ends abutting the y-axis, with 
Ei contained in the strip {(x, y) : ^ x ^ 2~ k }. Letting k — > 00 we see that 
F is a uniform Cantor set contained in the y-axis, which may be obtained by 
repeatedly removing a proportion 1 — 2e from the centre of each interval. Thus 
dim H ^ = log 2/ — loge < 1 (see Example 4.5). □ 

With such discontinuous behaviour, which becomes even worse for more 
involved sets of affine transformations, it is likely to be difficult to obtain a 
general expression for the dimension of self-affine sets. However, one situation 
which has been completely analysed is the self-affine set obtained by the follow- 
ing recursive construction; a specific case is illustrated in figures 9.10 and 9.11. 

Example 9.11 

Let the unit square Eq be divided into a p x q array of rectangles of sides l/p 
and \/q where p and q are positive integers with p < q. Select a subcollection of 
these rectangles to form E\, and let Nj denote the number of rectangles selected 
from the jth column for 1 ^ j ^ p; see figure 9.10. Iterate this construction in 
the usual way, with each rectangle replaced by an affine copy of E\, and let F 
be the limiting set obtained. Then 



dim H F — log y^ Nj 



log p/ log q I ' 



, -1 ' l0gP 



and 

log Pi 



diiriBF 



+ log — > JV; 

log/7 \pi £-J J logo 



where p\ is the number of columns containing at least one rectangle of E\. 

Calculation. Omitted. □ 

Notice in this example that the dimension depends not only on the number of 
rectangles selected at each stage, but also on their relative positions. Moreover 
dimnF and dim^F are not, in general, equal. 

*[The rest of this subsection may be omitted.] 
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N, ••• N„ 




Figure 9.10 Data for the self-affine set of Example 9.11. The affine transformations map 
the square onto selected l/p x \/q rectangles from the p x q array 

The above example is rather specific in that the affine transformations are 
all translates of each other. Obtaining a dimension formula for general self- 
affine sets is an intractable problem. We briefly outline an approach which leads 
to an expression for the dimension of the attractor of the affine contractions 
S/(x) — T{(x) + bi{\ ^ i < m) for almost all sequences of vectors b\, . . . , b m . 

Let T : E." — >■ E." be a linear mapping that is contracting and non-singular. 
The singular values 1 > ai ^ a2 ^ ■ • ■ ^ «n > of T may be defined in two 
ways: they are the lengths of the principal semi-axes of the ellipsoid T{B) where 






Figure 9.11 Construction of a self-affine set of the type considered in Example 9.11. 
Such sets may have different Hausdorff and box dimensions 
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B is the unit ball in W , and they are the positive square roots of the eigenvalues 
of T*T, where T* is the adjoint of T. Thus the singular values reflect the con- 
tractive effect of T in different directions. For ^ s ^ n we define the singular 
value function 

(p s (T) = a l a 1 ---a r ^a s l T r+x (9.22) 

where r is the integer for which r — 1 < s ^ r. Then ip s (T) is continuous and 
strictly decreasing in s. Moreover, for fixed s, cp" may be shown to be submul- 
tiplicative, i.e. 

<p s (TU) ^ <p s (T)<p s (U) 

for any linear mappings T and U . We introduce the kth. level sums E^ = 
^2 X (p s (T tl o ■ ■ ■ °T ik ) where Ij. denotes the set of all fc-term sequences (i\, . . . , i k ) 
with 1 ^ ij < m. For fixed 5 

^J2 ( P S ( T n°--- oT ^'P s (T iM o---oT i ^) 



J2<P S (T h o...o Tik )\ ^^(TJ.o.-.o^) = E*E» 
K i t } \i q J 

i.e. the sequence E| is submultiplicative in k. By a standard property of submul- 
tiplicative sequences, (Ep 1 /* converges to a number E;^ as k — > oo. Since <p s is 
decreasing in s, so is S^. Provided that EJ^ ^ 1, there is a unique s, which we 
denote by d(T u . . . , T m ), such that 1 = E^ = lim^ (^ f'ffi, - ■ • °T h )) l/k . 
Equivalently 

rf(r, , . . . , T m ) = inf J s : ]H £ <P'<. T h°- ■ ■ °T h ) < oo 1 . (9.23) 



Theorem 9.12 

Let T\, . . . , T m be linear contractions and let vi, . . . , y m e W be vectors. If F is 
the self-affine set satisfying 



F = \J(T i (F) + y i ) 



then dim^F = d\m^,F $5 d(7\, . . . , T m ). If |T,(x) — 7](y)| ^ c\x — y\ for all i 
where < c < i ^e« equality holds for almost all (V[, . . . , v m ) e IR" m m f/ze 
sense of nm- dimensional Lebesgue measure. 
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Figure 9.12 Each of the fractals depicted above is the attractor of the set of transforma- 
tions that map the square onto the three rectangles. The affine transformations for each 
fractal differ only by translations, so by Theorem 9.12 the three fractals all have the same 
dimension (unless we have been very unlucky in our positioning!). A computation gives 
this common value of Hausdorff and box dimension as about 1 .42 



Partial proof. We show that dim H F ^ d(T u . . . ,T m ) for all y\,...,y m . Write 
Si for the contracting affine transformation Si{x) — Ti(x) + yi. Let B be a 
large ball so that S, (B) C B for all i. Given 8 > we may choose k large 
enough to get |S,-,° ■ ■ ■ °Si t (B)\ < S for every k-term sequence (i\, . , , , 4) £ Ik- 
By (9.6) F c\J Ik S h o---oS k (B). But 5,-,o ■ ■ ■ oS h (B) is a translate of the 
ellipsoid T tl ° ■ ■ ■ of ik (B) which has principal axes of lengths oi\\B\, . . . ,ct n \B\, 
where (X\, . . . , tt n are the singular values of T^o ■ ■ ■ °T ik . Thus S,-,° ■ ■ ■ oS ik (B) is 
contained in a rectangular parallelepiped P of side lengths a\ \B\, . . . , a n \B\. If 
^ s $C n and r is the least integer greater than or equal to s, we may divide P 
into at most 



2o!i\ [2a 



or,. 



a r 



2a r - 
a r 



^2"ex, 



-ioc r 
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cubes of side a r \B\ < S. Hence S^o ■ ■ ■ oS ik (B) may be covered by a collection 
of cubes Ui with |{/,-| < 8*Jn such that 

J^\U i \ s ^2 n a 1 ---a r ^a l r - r a s r \B\ s 

i 

$;2"|B|> s (r, 1 o...o7;. i ). 

Taking such a cover of S,-,° ■ ■ ■ oS ik (B) for each (i\, . . . , it) e X* it follows that 
nl Vn (F)^r\B\ s J2'P s (Ti i o---oT ik ). 

Ik 

But jfc -► oo as 5 -+ 0, so by (9.23), H S (F) = if 5 > d(7i, . . . , T m ). Thus 
dim H F ^ d(T u ..., T m ). 

The lower estimate for dime/ 7 may be obtained using the potential theoretic 
techniques of Section 4.3. We omit the somewhat involved details. □ 

One consequence of this theorem is that, unless we have been unfortunate 
enough to hit on an exceptional set of parameters, the fractals in figure 9.12 all 
have the same dimension, estimated at about 1.42. 

9.5 Applications to encoding images 

In this chapter, we have seen how a small number of contractions can deter- 
mine objects of a highly intricate fractal structure. This has applications to data 
compression — if a complicated picture can be encoded by a small amount of 
information, then the picture can be transmitted or stored very efficiently. 

It is desirable to know which objects can be represented as, or approximated 
by, attractors of an iterated function system, and also how to find contractions that 
lead to a good representation of a given object. Clearly, the possibilities using, 
say, three or four transformations are limited by the small number of parameters 
at our disposal. Such sets are also likely to have a highly repetitive structure. 

However, a little experimentation drawing self-affine sets on a computer (see 
end of Section 9.1) can produce surprisingly good pictures of naturally occurring 
objects such as ferns, grasses, trees, clouds, etc. The fern and tree in figure 9.13 
are the attractors of just four and six affine transformations, respectively. Self- 
similarity and self-affinity are indeed present in nature. 

The following theorem, sometimes known as the collage theorem, gives an 
idea of how good an approximation a set is to the attractor of an IFS. 

Theorem 9.13 

Let {S\, ... , S m ] be an IFS and suppose that |£;(jc) — Si(y)\ ^ c\x — y\ for 
all x , y e W and all i, where c < 1. Let E C U." be any non-empty compact 
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Figure 9.13 The fern (a) and tree (b) are the attractors of just four and six affine trans- 
formations, respectively 



set. Then 



d(E,F)^d(E,\Js i (E)\—±- 



c) 



(9.24) 



where F is the attractor for the 1FS, and d is the Hausdorff metric. 



Proof. Using the triangle inequality for the Hausdorff metric followed by the 
definition (9.2) of the attractor 

(m \ / m \ 

E,\JSi(E)\+dl\JSi(E),F\ 

(m \ I m m \ 

e, |J s, ■ (£) j + d I |J St (E), |J Si (F) J 

s: d(E,{JSj(E)\ +cd(E,F) 
by (9.5), as required. D 



A consequence of Theorem 9.13 is that any compact subset of W can be 
approximated arbitrarily closely by a self-similar set. 
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Corollary 9.14 

Let E be a non-empty compact subset of W . Given 8 > there exist contracting 
similarities S\, . . . , S m with attractor F satisfying d(E, F) < S. 

Proof. Let B\, . . , , B m be a collection of balls that cover E and which have 
centres in E and radii at most |5. Then E C U*li &i ^ Fs/4, where £3/4 is 
the i<S -neighbourhood of F. For each i, let 5, be any contracting similarity 
of ratio less than i that maps E into /?,. Then S,-(F) C 5, C (5/ (F ))a/2, so 
(Ui=i Si(E)) C £^/4 and F C U£LiO$(^))*/2- By definition of the Hausdorff 
metric, d(E, |J"Li $(#)) ^ j S - lt foll ows from (9.24) that rf(F, F) < S where 
F is the attractor. D 

The approximation by the IFS attractor given by the above proof is rather 
coarse — it is likely to yield a very large number of contractions that take little 
account of the fine structure of E. A rather more subtle approach is required to 
obtain convincing images with a small number of transformations. One method 
which often gives good results is to draw a rough outline of the object and 
then cover it, as closely as possible, by a number of smaller similar (or affine) 
copies. The similarities (or affinities) thus determined may be used to compute an 
attractor which may be compared with the object being modelled. Theorem 9.13 
guarantees that the attractor will be a good approximation if the union of the 
smaller copies is close to the object. A trial and error process allows modification 
and improvements to the picture. 

More complex objects may be built up by superposition of the invariant sets 
of several different sets of transformations. 

Ideally, it would be desirable to have a 'camera' which could be pointed at an 
object to produce a 'photograph' consisting of a specified number of affine trans- 
formations whose attractor is a good approximation to the object. Obviously, the 
technical problems involved are considerable. One approach is to scan the object 
to estimate various geometric parameters, and use these to impose restrictions on 
the transformations. 

For example, for a 'natural fractal' such as a fern, we might estimate the dimen- 
sion by a box-counting method. The assumption that the similarities or affinities 
sought must provide an attractor of this dimension gives, at least theoretical, 
restrictions on the possible set of contractions, using results such as Theorem 9.3 
or 9.12. In practice, however, such information is rather hard to utilize, and we 
certainly need many further parameters for it to be of much use. 

Very often, attractors in the plane that provide good pictures of physical objects 
will have positive area, so will not be fractals in the usual sense. Nevertheless, 
such sets may well be bounded by fractal curves, a feature that adds realism to 
pictures of natural objects. However, fractal properties of boundaries of invariant 
sets seem hard to analyse. 

These ideas may be extended to provide shaded or coloured images, by 
assigning a probability pi to each of the contractions 5, , where < p, < 1 and 
Y^iL\ Pi = 1- Without going into details, these data define a mass distribution 
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fi on the attractor F such that /i(A) = Y17=i P'^i^i (-A))> and the set may be 
shaded, or even coloured, according to the local density of (X. 

This leads to the following modification of the second method of drawing 
attractors mentioned at the end of Section 9.1. Let xq be any initial point. We 
choose Sj l from S\ , ■ ■ ■ , S m at random in such a way that the probability of 
choosing 5 1 , is /?,, and let X\ — Sj^Xo). We continue in this way, so that X\ — 
Sj t (Xk-\) where Sj t equals 5 1 , with probability p t . Plotting the sequence {x^} 
(after omitting the first 100 terms, say) gives a rendering of the attractor F, but 
in such a way that a proportion p it ■ ■ ■ p ir of the points tends to lie in the part 
SjjQ ■ ■ ■ oSj (F) for each i\, . . . , i„. This variable point density provides a direct 
shading of F . Alternatively, the colour of a point of F can be determined by 
some rule, which depends on the number of {x^} falling close to each point. The 
computer artist may care to experiment with the endless possibilities that this 
method provides — certainly, some very impressive colour pictures have been 
produced using relatively few transformations. 

It is perhaps appropriate to end this section with some of the 'pros and cons' 
of representing images using iterated function systems. By utilizing the self- 
similarity and repetition in nature, and, indeed, in man-made objects, the method 
often enables scenes to be described by a small number (perhaps fewer than 
100) of contractions and probabilities in an effective manner. This represents an 
enormous compression of information compared, for example, with that required 
to detail the colour in each square of a fine mesh. The corresponding disadvan- 
tage is that there is a high correlation between different parts of the picture — the 
method is excellent for giving an overall picture of a tree, but is no use if the 
exact arrangement of the leaves on different branches is important. Given a set of 
affine contractions, reproduction of the image is computationally straightforward, 
is well-suited to parallel computation, and is stable — small changes in the con- 
tractions lead to small changes in the attractor. The contractions define the image 
at arbitrarily small scales, and it is easy to produce a close-up of a small region. 
At present, the main disadvantage of the method is the difficulty of obtaining a 
set of contractions to represent a given object or picture. 



9.6 Notes and references 

The first systematic account of what are now known as iterated function sys- 
tems is that of Hutchinson (1981), though similar ideas were around earlier. The 
derivation of the formula for the dimension of self-similar sets was essentially 
given by Moran (1946). Computer pictures of self-similar sets and attractors of 
other IFSs are widespread, the works of Mandelbrot (1982), Dekking (1982), 
Peitgen, Jiirgens and Saupe (1992) and Barnsley (1993) contain many interesting 
and beautiful examples. 

For details of the thermodynamic formalism and material relating to Theo- 
rem 9.9, see Ruelle (1983), Bedford (1991), Beck and Schlogl (1993), Falconer 
(1997) and Pesin (1997). 
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A discussion of self-affine sets is given by Mandelbrot (1986) and a survey 
on their dimension properties by Peres and Solomyak (2000). Full details of 
Example 9.11 are given by McMullen (1984) and of Theorem 9.12 by Falconer 
(1988) and Solomyak (1998). 

These ideas have been extended in many directions: for example, for IFSs with 
infinitely many transformations see Mauldin and Urbahski (1996, 1999), and for 
graph directed constructions see Mauldin and Williams (1988). 

Applications to image compression and encoding are described by Barnsley 
(1993), Barnsley and Hurd (1993) and Fisher (1995). 

Exercises 

9. 1 Verify that the Hausdorff metric satisfies the conditions for a metric. 

9.2 Find a pair of similarity transformations on R for which the interval [0, 1] is the 
attractor. Now find infinitely many such pairs of transformations. 

9.3 Find sets of (i) four and (ii) three similarity transformations on R. for which the 
middle third Cantor set is the attractor. Check that (9.13) has solution log 2/ log 3 
in each case. 

9.4 Write down (using matrix notation) the four basic similarity transformations that 
define the von Koch curve (figure 0.2). Find an open set for which the open set 
condition holds and deduce from Theorem 9.3 that the von Koch curve does indeed 
have box and Hausdorff dimension of log 4/ log 3. 

9.5 Find an IFS for the set depicted in figure 0.5 and deduce that it has Hausdorff and 
box dimensions given by 4(|) s + (i) s = 1. 

9.6 Sketch the first few steps in the construction of a self-similar set with generator 
• l •. What are the Hausdorff and box dimensions of this fractal? (The stem of 
the T is one quarter of the total length of the top.) 

9.7 Let F be the set obtained by a Cantor-type construction in which each interval is 
replaced by two intervals, one of a quarter of the length at the left-hand end, and one 
of half the length at the right-hand end. Thus Eq is the interval [0, 1], E\ consists 
of the intervals [0, i] and [i, 1], etc. Find an IFS with attractor F, and thus find the 
Hausdorff and box dimensions of F. 

9.8 Describe the attractors of the following IFSs on K. 



(i) Si 00 = \x, 


S 2 (x) = \x + \\ 


(ii) Sx(x) = \x, 


S 2 (x) = \x + i; 


(iii) Si(x) = \x, 


S 2 (x) = \x + i. 



9.9 Divide the unit square Eq into p 1 squares of side lip in the obvious way and choose 
some m of these squares to form E\ . Let S) (1 < i < m) be similarity transformations 
that map Eq onto each of these squares. Show that the attractor F of the IFS so 
defined has dmiH-F = dimsF = login/ log p. 

9.10 Let S,,S 2 : [0, 1] -*■ [0, 1] be given by Si(x) =x/(2 + x) and S 2 (x) = 2/(2 + x). 
Show that the attractor F of this IFS satisfies 0.52 < dim H F < 0.81. 

9.11 Show that any self-similar set F satisfying the conditions of Theorem 9.3 has c x < 
D_{F, x) < D(F, x) < c 2 for all x e F, where c\ and c 2 are positive constants. (See 
equations (5.2) and (5.3) for the definition of the densities.) 
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9.12 Let Si, . . ., S m be bi-Lipschitz contractions on a subset D of WL" and let F be the 
attractor satisfying (9.2). Show that, if V is any open set intersecting F, then F 
and F V have equal Hausdorff, equal upper box and equal lower box dimensions. 
Deduce from Corollary 3.9 that dim P F = dim B F. 

9.13 Verify the Hausdorff dimension formula in Example 9.11 in the cases (a) where 
Nj = N for 1 ^ j < p and (b) where Nj = N or for 1 < j < p, where N is an 
integer with I < N < q. (Hint: see Example 7.13.) 

9.14 Find the Hausdorff and box dimensions of the set in figure 9.11. 

9.15 Write a computer program to draw self-similar sets in the plane, given a generator 
of the set. 

9.16 Write a computer program to draw the attractor of a given collection of contractions 
of a plane region (see the end of Section 9.1). Examine the attractors of similarities, 
affinities and try some non-linear transformations. If you are feeling really enter- 
prising, you might write a program to estimate the dimension of these sets using a 
box-counting method. 



Chapter 10 Examples from number 
theory 



Fractals can often be defined in number theoretic terms; for instance, the middle 
third Cantor set consists of the numbers between and 1 which have a base-3 
expansion containing only the digits and 2. We examine three classes of fractals 
that occur in classical number theory — these examples serve to illustrate some 
of the ideas encountered earlier in the book. 



10.1 Distribution of digits of numbers 

In this section we consider base-m expansions of real numbers, where m ^ 2 
is a fixed integer. Let po, p\, ... , p m ~\ be 'proportions' summing to 1, so that 
< Pi < 1 and Yl?=o Pi — ^- Let F(po, . . . , p m -\) be the set of numbers x in 
[0, 1) with base-m expansions containing the digits 0, 1, . . . , m — 1 in proportions 
Po, ■ ■ ■ , p m -\ respectively. More precisely, if rij(x\k) denotes the number of times 
the digit j occurs in the first k places of the base-m expansion of x, then 

F(p , ..., /?,„_i) = [x e [0, 1) : lim rij(x\ k )/k = pj for all j = 0, . . . , m - 1}. 

(10.1) 
Thus we think of F(^, |) as the set of numbers with 'two thirds' of their base-2 
digits being 1 and the rest being 0. 

It is well-known that almost all numbers (in the sense of Lebesgue measure) are 
normal to all bases; that is, they have base-m expansions containing equal pro- 
portions of the digits 0, 1, . . . , m — 1 for all m. In our notation, F(m~ l , . . . , m~ l ) 
has Lebesgue measure 1, and therefore dimension 1, for all m. Paradoxically, no 
specific example of a number that is normal to all bases has ever been exhibited. 
We may use Hausdorff dimension to describe the size of the sets F(po, . . . , p m ^\) 
when the pj are not all equal. (Such sets are dense in [0, 1) so have box dimen- 
sion 1.) 

A mass distribution technique is used in the following proof — the mass dis- 
tribution occurs naturally as a probability measure. Note that we adopt the usual 
convention that x log = 0. 
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Proposition 10.1 

With F — F(p , . . . , p m -i) as above, 

1 m— 1 

&im u F = - V pi log pi . 

logm ^ 

Proof. The proof is best thought of probabilistically. We imagine that base-m 
numbers x = O.iii'2 • • • are selected at random in such a way that the kth digit j* 
takes the value j with probability pj, independently for each k. Thus we take 
[0, 1) as our sample space and define a probability measure P on subsets of [0, 1) 
such that if Ii u ...j k is the kth level basic interval containing the numbers with 
base-m expansion beginning O.i'i • • • j* then the probability of a number being in 
this interval is 

P(I h ,...,k) = Ph---Ph- (10-2) 

Given j, the events 'the kth digit of X is a j' are independent for k — 1,2,.... 
A consequence of the strong law of large numbers (see Exercise 1.27) is that, 
with probability 1, the proportion of occurrences of an event in a number of 
repeated independent trials tends to the probability of the event occurring. Thus, 
with probability 1, 

1 1 

— rij{x\k) — —(number of occurrences of j in the first k digits) — >• pj 

as k — > 00 for all j. Hence P(F) — 1. We write I^fx) = Z,,,....,^ for the kth level 
interval (of length m~ k ) to which x belongs. For a fixed y, the probability that 
x e h (y) is given by 

logP(4(.y)) = n (y\k)logpo-\ \-n m -i(y\ k )logp m -i 

by taking logarithms of (10.2). If y e F then n.j{y\k)/k — >• pj as k — > 00 for 
each 7, so 

1 . PC4O0) 1 D „ , » 1 _ fa 

7 lo § 1, , m, = 7 lo § p ( y * (y)) - 7 log m 

m—\ 

->■ ^ p, log p, -fslogm. 

!=0 

Hence, for all y in F, the 'interval density' 

,. P(h(y)) [0 if^<e 

lim = { ., . 

k^oo |4(y)| s [00 if s > 9 

where 



1 m— 1 

-; T] Pi log Pi- 

logm *— ' 



/=o 
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We are now virtually in the situation of Proposition 4.9. The same results hold and 
may be proved in the same way, if the 'spherical densities' lim r ^o H-(B(x, r))/r s 
are replaced by these interval densities. Thus H S (F) — oo if s < 8 and H S (F) — 
if 5 > 0, as required. □ 



10.2 Continued fractions 

Instead of defining sets of numbers in terms of base-m expansions, we may 
use continued fraction expansions. Any number x that is not an integer may be 
written as 

x = ciq + 1/jci 

where ciq is an integer and X\ > 1. Similarly, if X\ is not an integer, then 

Xi—a x + l/x 2 

with X2 > 1, so 

x — a Q + l/(ai + l/x 2 ). 

Proceeding in this way, 

x = a Q + l/(ai + 1/(02 + I/O ■ ■ + l/(a*_i + I/Jet)))) 

for each k, provided that at no stage is x^ an integer. We call the sequence of 
integers flo, di, cii, . . ■ the partial quotients of x, and write 

1 1 1 

x — aq + 



#1+ 02+ ^3 + 



for the continued fraction expansion of x. This expansion terminates if and only 
if x is rational, otherwise taking a finite number of terms, 

a + l/(ai + 1/(02 + 1/0 • • + 1/a*))) 

provides a sequence of rational approximations to x which converge to x as k — > 
oo. (Continued fractions are, in fact, closely allied to the theory of Diophantine 
approximation; see Section 10.3.) 

Examples of continued fractions include 

r 111 

V2= 1 + 



V3 = 1 + 



2+ 2+ 2 + ■ ■ ■ 
111 1 

T+2+T+2T^ 



More generally, any quadratic surd (i.e. root of a quadratic equation with integer 
coefficients) has eventually periodic partial quotients. 
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Sets of numbers defined by conditions on their partial quotients may be thought 
of as fractal attractors of certain iterated function systems, as the following 
example illustrates. 

Example 10.2 

Let F be the set of positive real numbers x with non-terminating continued fraction 
expressions which have all partial quotients equal to 1 or 2. Then F is a fractal 
with 0.44 < dim H F < 0.66. 

Proof. It is easy to see that F is closed (since its complement is open) and 
bounded (since F C [1, 3]). Moreover, x e F precisely when x — 1 + \/y or 
x — 2 + \/y with y e F. Letting Si(x) = 1 + 1/* and S 2 (x) = 2 + 1/x, it fol- 
lows that F — S](F) U S2(F); in other words, F is the attractor of the iterated 
function system {Si, S2} in the sense of (9.2). In fact F is just the set analysed 
in Example 9.8 which we showed to have Hausdorff dimension between 0.44 
and 0.66. □ 

Computational methods have been developed that enable dimensions of sets 
defined in terms of continued fraction expansions to be found very accurately. 
The set F of Example 10.2 has Hausdorff dimension 0.531280506 .... 

Obviously, varying the conditions on the partial quotients will lead to other 
fractals that are the attractors of certain transformations. 



10.3 Diophantine approximation 

How closely can a given irrational number x be approximated by a rational p/q 
with denominator q no larger than #0? Diophantine approximation is the study of 
such problems, which can crop up in quite practical situations (see Section 13.6). 
A classical theorem of Dirichlet (see Exercise 10.8) states that for every real 
number x, there are infinitely many positive integers q such that 





p 


1 


x - 




^7 




q 



for some integer p; such p/q are 'good' rational approximations to x. Equiva- 
lently, 

\\qx\\^q- 1 

for infinitely many q, where ||y|| = min m€ 2 \y — m\ denotes the distance from y 
to the nearest integer. 

There are variations on Dirichlet' s result that apply to almost all numbers x. 
It may be shown that if i/r(q) is a decreasing function of q with ^ if(q) then 

||0x|| s: f(q) (10.3) 
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is satisfied by infinitely many q for almost all x or almost no x (in the sense of 
1 -dimensional Lebesgue measure) according to whether Y1T=1 VK*) diverges or 
converges. In the latter case, the set of x for which (10.3) does have infinitely 
many solutions not only has Lebesgue measure 0, but is often a fractal. 
We speak of a number x such that 

11**11 < 9 1 "" (10.4) 

for infinitely many positive integers q as being a-well approximable. It is natural 
to ask how large this set of numbers is when a > 2, and, indeed, whether such 
irrational numbers exist at all. We prove Jarnfk's theorem, that the set of a-well 
approximable numbers has Hausdorff dimension 2/oc. 

It is almost immediate from Example 4.7 (check!) that the set of a-well approx- 
imable numbers has dimension at least 1 /a . An extra subtlety is required to obtain 
a value of 2/a. The idea is as follows. Let G q be the set of x e [0, 1] satisfying 
(10.4). A factorization argument shows that, if n is large and p\, p2 are primes 
with n < pi, P2 ^ 2n, then G pi and G P2 are disjoint (except for points very close 
to or 1). Thus the set 

H„ — [_) G p 

p prime 
n<p^2n 

consists of, roughly, ^«<n<2n 1/P — n 2 /logn reasonably regularly spaced inter- 
vals of lengths at least 2(2n)~ a . We then show that if n^ is a rapidly increasing 
sequence, the intersection Htli Hn k has dimension at least 2/a, and note that 
any number in this intersection lies in infinitely many G p and so is a-well 
approximable. 



Jarnik's theorem 10.3 

Suppose a > 2. Let F be the set of real numbers x e [0, 1] for which the inequal- 
ity 

11**11 ^q 1 '" (10.5) 

is satisfied by infinitely many positive integers q. Then dmin^ = 2/a. 

* Proof. For each q let G q denote the set of x e [0, 1] satisfying (10.5). Then G q 
consists of q — 1 intervals of length 2q~ a and two 'end' intervals of length q~ a . 
Clearly, F C Ui^Lt Gq f° r eac h k, so taking the intervals of G q for q ^ k as 
a cover of F gives that H S S (F) sC £^=t(9 + l)(2^"") s if 2k~ a sC 8. If s > 2/a 
the series E^Li(<7 + 1)(2<7~T converges, so lim^^ Y^=M + 1)( 2 *~T = ° 
and H S (F) = 0. Hence dim^ ^ 2/a. 

Let G' be the set of x e (q~ a , 1 — q~ a ) satisfying (10.5), so that G' is just G q 
with the end intervals removed. Let n be a positive integer, and suppose pi and 
Pi are prime numbers satisfying n < p\ < pi ^ 2m. We show that G' and G' 
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are disjoint and reasonably well separated. For if 1 ^ r\ < p\ and 1 < r 2 < p 2 , 
then p\r 2 ^ p 2 r\ since p\ and p 2 are prime. Thus 



P\ Pi 



1 1 1 

\Pir\ - P\r 2 \ ^ ^ — t 

P1P2 Pi Pi 4n z 



that is, the distance between the centres of any pairs of intervals from G' and 
G' is at least l/4n 2 . Since these intervals have lengths at most 2rT a , the distance 
between any point of G' and any point of G' is at least \n~ 2 — 2rT a ^ \ n ^ 1 
provided that n ^ «o for some sufficiently large «o- For such n the set 



Hn= u g ; 



/? prime 
n<p^ln 



is a disjoint union of the intervals in the G' p , so //„ is made up of intervals of 
lengths at least (2n)~ a which are separated by gaps of lengths at least \n~ 2 . 
If / C [0, 1] is any interval with 3/|/| < n < p ^ 2n then at least p\I\/3 ^ 
n\I\/3 of the intervals of G' are completely contained in /. A version of the 
prime number theorem states that the number of primes between 2 and n is 
asymptotically n/logn, so there are at least «/(21ogn) primes in the range 
(n, 2n] if n ^ Hi, f° r some large ri\ ^ «o- Thus at least 



n 2 \l\ 
61ogn 



(10.6) 



intervals of H n are contained in / provided that n ^ ri\ and |/| ^ 3/n. 

To complete the proof, we use Example 4.6. With ri\ as above, let n k — 
max{n*_ p 3 x 2 a n a k _ x \, for k = 2, 3, . . ., where a > a is an integer. Let Eq = 
[0, 1] and for k — 1,2,... let E k consist of those intervals of H„ k that are com- 
pletely contained in E k ^\. The intervals of E k are of lengths at least {2n k )~ a and 
are separated by gaps of at least e k — \n k . Using (10.6), each interval of E k _\ 
contains at least mj intervals of E k , where 

n 2 k (2n k ^)- a cn\n- k a _ x 

m k — = 

6\ogn k logn k 

ifk^2, where c = 2~ a /6. (We take m x = 1.) By (4.7) 



dim H ( P| E k 

log[c k - 2 n- a (n 2 ■ ■■n k „ 2 ) 2 - a nl_ l (\ogn 2 )- x ■ ■ ■ (lognu)- 1 ] 



\k=l 



^ lim 



*^oo -lOgLCT^j 



logtcn^^Slogn^)-'] 
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,. log[c*- 2 nr"(«2 ■ ■ ■ n k . 2 ) 2 - a (logn 2 r l ■ ■ • (logn^y 1 ] + 21ogn*_i 
F^ - log(c/8) + logfe(lognn) + a lognu 

= 2/a 

since the dominant terms in numerator and denominator are those in lognn. 
(Note that logn^ = &log«;t_i so logw^ = ck\ for A: sufficiently large.) If x e 
£* C H„ 4 for all k, then x lies in infinitely many of the G' and so x e F. Hence 
dimnF ^ 2/a. D 

*[The rest of this section may be omitted.] 

Obviously, the set F of Jarnfk's theorem is dense in [0, 1], with dim^F n 7) = 
2/a for any interval /. However, considerably more than this is true, F is a 'set 
of large intersection' of the type discussed in Section 8.2, and this has some 
surprising consequences. For the definition of C s , in the following proposition, 
see (8.7) and (8.8). 

Proposition 10.4 

Suppose a > 2. If F is the set of positive numbers such that \\qx\\ SC q l ~ a for 
infinitely many q, then F e C[0, oo) for all s < 2/a. 

Note on proof. This follows the proof of Jarnfk's Theorem 10.3 up to the def- 
inition of H n . Then a combination of the method of Example 8.9 and prime 
number theorem estimates are used to show that rim,,^,^ H^il H n ) — Ti. s (J). 
Slightly different methods are required to estimate the number of intervals of 
H n that can intersect a covering interval U, depending on whether |/| < \/n or 
|/| ^ \/n. □ 

The first deduction from Proposition 10.4 is that dimuF — 2/a, which we 
know already from Jarnfk's Theorem. However, Proposition 8.8 tells us that 
smooth bijective images of F are also in C . Thus if s < 2/a then f(F n [a, b\) 
is in C s [f(a), f(b)] for any continuously differentiable function / : [a, b] — >■ E. 
with \f'(x)\ > c > 0. Taking the functions given by f m (x) — x l ^ m we have that 
f m (F) n [1, 2] is in C s \\, 2] for s < 2/a. It follows from Proposition 8.6 that 
n~=i/ m COn[l,2]isinC' ! [l,2], so 

dim H ( f)fm(F)) =2/a. 

\m=\ / 

But 

f„(F) — {x : \\qx m \\ ^ q 1 "" for infinitely many q\ 

so we have shown that the set of x for which all positive integral powers are 
a-well-approximable has Hausdorff dimension 2/a. 

Clearly, many variations are possible using different sequences of functions f m . 
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10.4 Notes and references 

There are many introductory books on number theory, of which the classic by 
Hardy and Wright (1979) is hard to beat. 

The dimensional analysis of the distribution of base-m digits is given in 
Billingsley (1978). Continued fractions are discussed in most basic texts in num- 
ber theory. Rogers (1998) and Bumby (1985) discuss dimensional aspects, and 
Mauldin and Urbahski (1999) consider them in the context of IFSs with infinitely 
many mappings. Hensley (1996) has calculated the Hausdorff dimension of var- 
ious sets of numbers defined by continued fractions to many decimal places. 

Full accounts of Diophantine approximation are to be found in the books of 
Cassels (1957) and Schmidt (1980). Various proofs of Jarnik's theorem have 
been given by Jarnik (1931), Besicovitch (1934), Eggleston (1952) and Kaufman 
(1981). The book by Bernik and Dodson (1999) discusses many dimensional 
aspects of Diophantine approximation. 

Some novel relations between fractals and number theory may be found in 
Lapidus and van Frankenhuysen (2000). 

Exercises 

10.1 Find an IFS of similarities on R. with attractor F consisting of those numbers in 
[0, 1] whose decimal expansions contain only even digits. Show that diniH-F = 
log 5/ log 10. 

10.2 Show that the set F(po, . . . , p m -\) in (10.1) is the attractor of a set of m similarity 
transformations in the sense of (9.2). (It is not, of course, compact.) 

10.3 Find the Hausdorff dimension of the set of numbers whose base-3 expansions 
have 'twice as many 2s as Is' (i.e. those x such that 2rim,t_ > . 00 m(x\k)/k = 
limj-^oo ri2(x\k)/k with both these limits existing). 

10.4 Find the continued fraction representations of (i) 41/9 and (ii) V5. 

10.5 What number has continued fraction representation 

1 1 1 

1 + 



1+1+1 + - 



10.6 Use the continued fraction representation of \Fl (with partial quotients 1, 2, 2, 
2, . . .) to obtain some good rational approximations to V2- (In fact, the number 
obtained by curtailing a partial fraction at the fcth partial quotient gives the best 
rational approximation by any number with equal or smaller denominator.) 

10.7 Find an iterated function system whose attractor is the set of positive numbers with 
infinite continued fraction expansions that have partial quotients containing only 
the digits 2 and 3. Thus obtain estimates for the Hausdorff and box dimensions of 
this set. 

10.8 Let x be a real number and Q a positive integer. By considering the set of numbers 
{rx(modl) : r = 0, 1, . . . , Q), prove Dirichlet's theorem: i.e. that there is an integer 
q with < q ^ Q such that \\qx\\ ^ Q _1 . Deduce that there are infinitely many 
positive integers q such that \\qx\\ ^ q~ . 
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10.9 Let n and d be positive integers. Show that if the Diophantine equation x" — dy" = 1 
has infinitely many solutions (x, y) with x and y positive integers, then d 11 " must 
be n-well-approximable. 

10.10 Fix a > 3, let F be the set of (x, y) in R 2 such that \\qx\\ < q l ~ a and \\qy\\ < q 1 ^ 
are satisfied simultaneously for infinitely many positive integers q. Show, in a 
similar way to the first part of the proof of Theorem 10.3, that dim H -F ^ 3/a. (In 
fact, it may be shown, using a generalization of the remainder of the proof, that 
dim H -F = 3/a.) 

10.11 Show that the set of real numbers x, such that (x + m) 2 is a-well-approximable 
for all integers m, has Hausdorff dimension 2/a. 



Chapter 1 1 Graphs of functions 



A variety of interesting fractals, both of theoretical and practical importance, 
occur as graphs of functions. Indeed, many phenomena display fractal features 
when plotted as functions of time. Examples include wind speed, levels of reser- 
voirs, population data and prices on the stock market, at least when recorded 
over fairly long time spans. 

11.1 Dimensions of graphs 

We consider functions / : [a, b] — > R. Under certain circumstances the graph 

graph/ = {(/, f(t)):a^t O} 

regarded as a subset of the (t, x) -coordinate plane may be a fractal. (Note that 
we work with coordinates (t , x) rather than (x, y) for consistency with the rest 
of the book, and because the independent variable is frequently time.) If / has 
a continuous derivative, then it is not difficult to see that graph / has dimension 
1 and, indeed, is a regular 1-set, see Section 5.1. The same is true if / is of 
bounded variation; that is, if Y17=o \f(fi) ~ f (fi + 1 ) I ^ constant for all dissections 
= to < t\ < ■ ■ ■ < t m — 1. However, it is possible for a continuous function to 
be sufficiently irregular to have a graph of dimension strictly greater than 1. 
Perhaps the best known example is 



/(o = X> & - 2) *siii(A*o 



k=\ 

where 1 < s < 2 and X > 1. This function, essentially Weierstrass's example of 
a continuous function that is nowhere differentiable, has box dimension s, and is 
believed to have Hausdorff dimension s . 

We first derive some simple but widely applicable estimates for the box dimen- 
sion of graphs. Given a function / and an interval \t\, t{[, we write Rf for the 
maximum range of / over an interval, 

R f [t u t 2 ]= sup \f{t)-f(u)\. 
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Proposition 11.1 

Let f : [0, 1] — > R be continuous. Suppose that < 8 < 1, and m w //;e /easf 
integer greater than or equal to 1/5. 77zew, j/ A^ is the number of squares of the 
S-mesh that intersect graph/, 



8~ l J2 R A iS > d + l ) s 1 ^N s ^2m + s -1 Y^ R A iS > (*' + l ) s l 



(11.1) 



/=o 



!=0 



Proof. The number of mesh squares of side <5 in the column above the interval 
[iS, (i + 1)5] that intersect graph / is at least R/[i8, (i + 1)S]/S and at most 2 + 
R/[iS, (i + 1)8]/ S, using that / is continuous. Summing over all such intervals 
gives (11.1). This is illustrated in figure 11.1. □ 

This proposition may be applied immediately to functions satisfying a 
Holder condition. 



Corollary 11.2 

Let f : [0, 1] - 

(a) Suppose 



be a continuous function. 



|/(f)- f(u)\^c\t -u\ 2 ~ s ((Kf.K^l) 



(11.2) 



where c > and 1 ^ s ^ 2. TTien W 1 (graph /) < oo <2«<f dim H graph / SC 
diiri p graph / ^ dime graph / $C s. This remains true provided (11.2) holds 
when \t — u\ < 8 for some 8 > 0. 




Figure 11.1 The number of <5-mesh squares in a column above an interval of width S 
that intersect graph/ is approximately the range of / over that interval divided by 8. 
Summing these numbers gives estimates for the box dimension of graph / 
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(b) Suppose that there are numbers c > 0, <$o > and 1 sj s < 2 with the fol- 
lowing property: for each t e [0, 1] and < 8 ^ 8q there exists u such 
that \t — u\ ^ 8 and 

\f(t)-f(u)\^c8 2 -\ (11.3) 

77zew s $C dim B graph /. 
Proof 

(a) It is immediate from (11.2) that Rf[t\ , h] ^ c\h - h\ 2 ~ s for ^ h , h ^ 1- 
With notation as in Proposition 11.1, m < (1 + <5~') so by (11.1) 

N s ^2m + 8- l mc8 2 - s sC (1 + S -1 )(2 + cnT'S 2- *) ^ Ci<T* 

where Ci is independent of 5. The result now follows from Proposition 4.1. 

(b) In the same way, (11.3) implies that R/[t\, t-i\ ^ c\t\ — ?2| 2 s - Since 8~ l ^ 
m, we have from (11.1) that 

N s ^ S- l mc8 2 - s ^ h8- l 8- l c8 2 - s = c8~ s 

so Equivalent definition 3.1(iii) gives s ^ dim B graph/. D 

Unfortunately, lower bounds for the Hausdorff dimension of graphs are gen- 
erally very much more awkward to find than box dimensions. 

Example 11.3. The Weierstrass function 

Fix X > I and 1 < s < 2. Define f : [0, 1] — > IR fry 

00 

/( f ) = ^A (s " 2) * sin(A.*f). (11-4) 

k=\ 

Then, provided X is large enough, dime graph f — s. 
Calculation. Given < h < A -1 , let N be the integer such that 

X- {N+r > ^ h < X- N . (11.5) 

Then 

|/(f + /?) - /(f) | ^ ^ A ( ' 5 ~ 2)/: | sin(A*(f + h)) - sin(A.*OI 

k=\ 

00 

+ X! A. (j_2) *|sin(A,*(f + h))-sin(X k t)\ 

k=N+l 
N oo 

k=\ k=N+\ 
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using that | sinw — sinv| ^ \u — v\ (a consequence of the mean-value theorem) 
on the first N terms of the sum, and that | sin u | < 1 on the remaining terms. 
Summing these geometric series, 

hk (s-DN 2x (s-2)(N+l) 

\f(t + h)- f(t)\ ^ — + — 

J J 1 -X l ~ s 1 - X s ' 2 

^ ch 2 ~ s 

where c is independent of h, using (11.5). Corollary 11.2(a) now gives that 
dim B graph / ^ s. 

In the same way, but splitting the sum into three parts — the first N — 1 terms, 
the Nth term, and the rest — we get that 

\f(t + h)- fit) - k {s - 2)N (smk N {t + h)- smX N t)\ 

k (s-2)N-s+l 2X (s ~ 2){N+l) 

^T3Ii^ + T3^ (11-6) 

ifx-^+ r >^h<x- N . 

Suppose X > 2 is large enough for the right-hand side of (11.6) to be less than 

l_X.(s-W for all N ^ p or g K k -l^ take N such mat k -N ^ & < k -(N-l)_ p or each 

f, we may choose h with A~ (/v+1) ^ h < X~ N < S such that | sinX N (t + h) — 
smX N t\ > ^, so by (11.6) 

\f(t + h)- f(t)\ > ±X^ N - ±X<^ N = ±X^ N > ±X°- 2 8 2 -\ 
It follows from Corollary 11.2(Z?) that dim B graph/ ^ s. □ 

Various cases of the Weierstrass function are shown in figure 1 1 .2. 

It is immediate from the above estimate that the Hausdorff dimension of the 
graph of the Weierstrass function (11.4) is at most s. It is widely believed 
that it equals s, at least for 'most' values of X. This has not yet been proved 
rigorously — it could be that there are coverings of the graph of the function 
by sets of widely varying sizes that give a smaller value. Even to show that 
dim H graph / > 1 is not trivial. The known lower bounds come from mass dis- 
tribution methods depending on estimates for C{t : (t, f(t)) € B] where B is a 
disc and C is Lebesgue measure. The rapid small-scale oscillation of / ensures 
that the graph is inside B relatively rarely, so that this measure is small. In this 
way it is possible to show that there is a constant c such that 

s ^ dim H graph / ^ s — cj log X 

so when X is large the Hausdorff dimension cannot be much less than the con- 
jectured value. 
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Figure 11.2 The Weierstrass function f(t) = 2X A (s_2) * sin(X*0 with X = 1.5 and (a) 
5 = 1.1, (6) s = 1.3, (c) 5 = 1.5, {d) s = 1.7 
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Figure 11.2 Continued 
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The Weierstrass function (11.4) is representative of a much wider class of 
functions to which these methods apply. If g is a suitable periodic function, a 
similar method can often show that 



/(o = X> 



(s-2)k, 



(X k t) 



(11.7) 



k=\ 



has dime graph/ = s. At first such functions seem rather contrived, but their 
occurrence as repellers in certain dynamical systems (see Exercise 13.9) gives 
them a new significance. 

In Section 9.4 we saw that self-affine sets defined by iterated function systems 
are often fractals; by a suitable choice of affine transformations, they can also 
be graphs of functions. Let [Sj , . . . , S m ] be affine transformations represented in 
matrix notation with respect to (t , x) coordinates by 



Si 



\/m 

CI; 



o" 




t 


+ 


c > - 




X 





(i - l)/m 



(11.8) 



that is 



S((t, x) — (t/m + (i — \)/m, cijt + dX + £>,). 



Thus the S, transform vertical lines to vertical lines, with the vertical strip ^ 
t ^ 1 mapped onto the strip (i — \)/m ^ t ^ i/m. We suppose that 



\/m < ci < 1 



(11.9) 



so that contraction in the t direction is stronger than in the x direction. 

Let pi = (0, bi/(l - ci)) and p m = (1, (a m + b m )/{\ - c m )) be the fixed 
points of S\ and S m . We assume that the matrix entries have been chosen so that 



Si(p m ) = S i+ i(pi) (1 ^ i ^ m - 1) 



(11.10) 



so that the segments [5,-(pi), Si(p m )] join up to form a polygonal curve E\. To 
avoid trivial cases, we assume that the points S\ (p\), . . . , S m (p{), p m are not all 
collinear. The attractor F of the iterated function system {Si, ... , S m ) (see (9.2)) 
may be constructed by repeatedly replacing line segments by affine images of 
the 'generator' E\\ see figures 11.3 and 11.4. Condition (11.10) ensures that the 
segments join up with the result that F is the graph of some continuous function 
/ : [0, 1] — > R. Note that these conditions do not necessarily imply that the S, 
are contractions with respect to Euclidean distance. However, it is possible to 
redefine distance in the (x, t) plane in such a way that the 5; become contractions, 
and in this context the IFS theory guarantees a unique attractor. 



Example 11.4. Self-affine curves 

Let F — graph / be the self-affine curve described above. Then dims F 
log(ci H hc,„)/logm. 



1 + 
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Figure 11.3 Stages in the construction of a self-affine curve F. The affine transforma- 
tions Si and 52 map the generating triangle p\ppi onto the triangles piqip and pq2P2, 
respectively, and transform vertical lines to vertical lines. The rising sequence of polyg- 
onal curves Eq, E\, . . . are given by Efc+i = S\{Et) U S2CE*) and provide increasingly 
good approximations to F (shown in figure 11.4(a) for this case) 

Calculation. Let T, be the 'linear part' of St, represented by the matrix 

\/m 



Let Ii u ...,i k be the interval of the f-axis consisting of those / with base-m expansion 
beginning O./J • • • i' k where i'. — ij — 1. Then the part of F above Ii u ...j k is the 
affine image 5,-,° ■ ■ ■ oS it (F), which is a translate of 7),o ■ ■ ■ o T it (F). The matrix 
representing 7},o ■ ■ ■ o T ik is easily seen by induction to be 







1— it- , ? — k 

m aj 1 +m Ci x a^ + 



i Wj w 2 



This is a shear transformation, contracting vertical lines by a factor Cj^c^ 
Observe that the bottom left-hand entry is bounded by 



\m l k a + m 2 k c h a + 



+ C 



s: «mc) l ~ k + (mc) 2 ~ k + 
^ rc h ■ ■ ■ <Vi 



ll L U-1 

+ l)c,- 



a\ 



where a — max |a,-|, c = min{c,-} > 1/m and r = a/(l — (mc) -1 ) from summing 
the geometric series. Thus the image 7^° • • • ° Ti k {F) is contained in a rectangle 
of height (r + h)c^ ■ ■ ■ c, t where h is the height of F. On the other hand, if 
q\,q2,qi are three non-collinear points chosen from S\(p\), . . . , S m {p\), p m , 
then TjjQ . . . o T ik (F) contains the points 7^° • • • o T ik (qj)(j — 1, 2, 3). The height 
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(") 




(b) 

Figure 11.4 Self-affine curves defined by the two affine transformations that map the 
triangle P\PP2 onto p\q\p and pqipi respectively. In (a) the vertical contraction of both 
transformations is 0.7 giving dim B graph / = 1 .49, and in (b) the vertical contraction of 
both transformations is 0.8, giving dime graph / = 1.68 
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of the triangle with these vertices is at least c,, ■ ■ ■ c ik d where d is the vertical 
distance from q^ to the segment [q\, q^\. Thus the range of the function / over 
hi,...,i k satisfies 

dc h ■ ■ ■ c h ^ RfUi u ...,i t ] ^ r 1 c h ■ ■ ■ c h 

where r\ = r + h. 

For fixed k, we sum this over the m k intervals Ii u ...j k of lengths m~ k to get, 
using Proposition 11.1, 



m k 



d X! C; > " ' c 't ^ N m-"(F) ^ 2m k + m k r l ^ c h ■ ■ ■ c u 



where N m -t(F) is the number of mesh squares of side m k that intersect F . For 
each j the number c, ranges through the values C\, . . . , C m , SO that ^ c,-, ■ ■ ■ c, t — 
(ci + h C m )*. Thus 

rfm^ci + ■ ■ ■ + c^,) 4 ^ N m -k(F) ^ 2m* + rim^C! + ■ ■ ■ + c ffl ) 4 . 

Taking logarithms and using Equivalent definition 3.1(iii) of box dimension gives 
the value stated. □ 

Self-affine functions are useful for fractal interpolation. Suppose we wish to 
find a fractal curve of a given dimension passing through the points (i/m, x,) for 
i = 0, 1, . . . , m. By choosing transformations (11.8) in such a way that 5, maps 
the segment [p t , p m ] onto the segment [((;' — \)/m, x t _x), (i/m, *,-)] for each i, 
the construction described above gives a self-affine function with graph passing 
through the given points. By adjusting the values of the matrix entries we can 
ensure that the curve has the required box dimension; there is also some freedom 
to vary the appearance of the curve in other ways. Fractal interpolation has been 
used very effectively to picture mountain skylines. 

Of course, self-affine functions can be generalized so that the 5, do not all have 
the same contraction ratio in the / direction. This leads to fractal interpolation of 
points that are not equally spaced. With additional effort, the box dimension of 
such curves may be found. 

One example of fractal interpolation is illustrated in figure 11.5. 



*11.2 Autocorrelation of fractal functions 

As we have remarked, quantities varying with time often turn out to have fractal 
graphs. One way in which their fractal nature is often manifested is by a power- 
law behaviour of the correlation between measurements separated by time h . In 
this section we merely outline the ideas involved; we make no attempt to be 
rigorous. In particular, the limits used are all assumed to exist. 

For convenience of analysis, we assume that / : U. — > U. is a continuous 
bounded function and we consider the average behaviour of / over long periods 
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(c) 



(d) 



Figure 11.5 Fractal interpolation on the northern and southern halves of a map of South- 
West England, using the vertices of the polygon in figure (a) as data points. The dimen- 
sions of the self-affine curves fitted to these data points are (b) 1.1, (c) 1.2 and id) 1.3 



[-T, T]. (Similar ideas hold if / is just denned on [0, oo), or on a finite interval, 
by extending / to R in a periodic manner). We write / for the average value of 
/, i-e. 



T— >oo 



-I 

2T J_. 



f(t)dt. 



A measure of the correlation between / at times separated by h is provided by 
the autocorrelation function 



C(h) = lim 



= lim 



^J (f(t + h)-f)(f(t)-f)dt 
^J f(t + h)f(t)dt-(f) 2 . 



(11.11) 
(11.12) 



From (11.11) we see that C(h) is positive if f(t + h) — f and fit) — f tend to 
have the same sign, and is negative if they tend to have opposite signs. If there 
is no correlation, C(h) — 0. Since 



/ 



(fit + h)- fityfdt = ff(f + hfdt + J fitfdt -ijfit + h)fit)dt 
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we have 



i _l_ c T 

2 r^o TT J_ T 



C(h) = f 2 - {ff - - lim — / (f(t +h)- f(t)) 2 dt 



1 ,. 1 f T 
-lim — / ( 

2 r^oo 2T J_ T 



where 



= C(0) - - lim — / (f(t + h)- f(t)) 2 dt (11.13) 



-f 



/ 2 = I™ — /(0 2 dr 



is the mean square of /, assumed to be positive and finite. With C(h) in the form 
(11.13) we can infer a plausible relationship between the autocorrelation function 
of / and the dimension of graph /. The clue is in Corollary 11.2. Suppose that / 
is a function satisfying (11.2) and also satisfying (11.3) in a 'reasonably uniform 
way'. Then, there are constants C\ and c-i such that 



2T S_t 



c y h*- ls ^ — / {f{t + h)- f(t)) 2 dt ^ c 2 h 4 ~ 2s (11.14) 



for small h. Obviously, this is not directly equivalent to (11.2) and (11.3), but in 
many reasonably 'time-homogeneous' situations, the conditions do correspond. 
Thus if the autocorrelation function of / satisfies 

C(0) -C(h)~ ch 4 - 2s 

for small h, it is reasonable to expect the box dimension of graph/ to equal s. 
The autocorrelation function is closely connected with the power spectrum of 
/, defined by 

1 f 1 

J ^ f(t)e'""di 



S(tui) — lim 

r^oo 27/ 



-T 



(11.15) 



For functions with any degree of long-term regularity, S(ol>) is likely to exist. 
The power spectrum reflects the strength of the frequency a> in the harmonic 
decomposition of /. 

We show that the power spectrum is the Fourier transform of the autocorrelation 
function. By working with /(f) — / we may assume that / has zero mean. Let 
fr(t) be given by /(/) if \t\ ^ T and be otherwise, and define 



2T J_ 



C T (h) = — f T (t + h)f T (t)dt 



1 

-^fr * fr(-h) 
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where ff(t) — fr(—t) and * denotes convolution. By the convolution theorem 
for Fourier transforms (see Section 4.4) this equation transforms to 

1 „_ 
C T (co) = —f T (co)f T (co) 

= ^iAmi 2 

where C T (o>) = f^ C T (t )e i "°dt and f T (u>) = f^ f T (t)e i,a> At are the usual 
Fourier transforms. (Note that we cannot work with the transform of / itself, 
since the integral would diverge.) Letting T — > oo we see that Cjih) — > C(h) 
for each h and Ct(w) — > S(a>) for each a>. It may be shown that this implies that 

C(co) — S(cd). 

Clearly S and C are both real and even functions, so the transforms are cosine 
transforms. Thus 

/OO /*0O 

COOe'^df = / C(t)cos(cot)dt (11.16) 

-oo J — oo 

and, by the inversion formula for Fourier transforms, 

1 f°° 1 f°° 

C(h) = — / S{o))s~' hw dco = — / S(co)cos(a)h)dco. (11.17) 

In this analysis we have not gone into questions of convergence of the integrals 
too carefully, but in most practical situations the argument can be justified. 

Autocorrelations provide us with several methods of estimating the dimension 
of the graph of a function or 'signal' /. We may compute the autocorrelation 
function C(h) or equivalently, the mean-square change in signal in time h over 
a long period, so from (11.13) 



2[C(0)-C(ft)]~^( (f(t + h) - f(t)fdt. (11.18) 



If the power-law behaviour 

C(0) - C(h) ~ ch 4 - 2s (11.19) 

is observed for small h, we might expect the box dimension of graph / to be s. 
In other words, 

log(C(0) - Cih)) 

dim B graph / = 2 - lim — (11.20) 

fc->o 21og/z 
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if this limit exists. We might then seek functions with graphs known to have 
this dimension, such as those of Examples 11.3 and 11.4 or the fractional Brow- 
nian functions of Section 16.2 to provide simulations of signals with similar 
characteristics. 

Alternatively, we can work from the power spectrum S(co) and use (11.17) to 
find the autocorrelation function. We need to know about C(0) — C(h) for small 
h; typically this depends on the behaviour of its transform S(a>) when co is large. 
The situation of greatest interest is when the power spectrum obeys a power law 
S(co) ~ c/af for large co, in which case 

C(0)-C(/i)~M""' (11.21) 

for small h, for some constant b. To see this formally note that from (11.17) 

poo /iOO 

jr(C(0) - C(h)) = / 5(a))(l - cos(coh))dco = 2 S(a>) sin 2 (\coti) da. 
Jo Jo 

and taking S(co) — co~ a gives 

ijr(C(0) - C(h)) = / oT" sin 2 \coh dco = h"' 1 / u' a sin 2 \u du 
Jo Jo 



having substituted u — coh. It may be shown that (11.21) also holds if S is 
any sufficiently smooth function such that S(co) ~ cco~ a asw^ oo. Comparing 
(11.19) and (11.21) suggests that graph/ has box dimension s where A — 2s — 
a — 1, or s — |(5 — a). Thus it is reasonable to expect a signal with a \/co a 
power spectrum to have a graph of dimension i(5 — a) for 1 < a < 3. 

In practice, curves of dimension i(5 — a) often provide good simulations and 
display similar characteristics to signals observed to have \/co a power spectra. 



11.3 Notes and references 

The dimension of fractal graphs was first studied by Besicovitch and Ursell 
(1937). For more recent work on Weierstrass-type curves see Berry and Lewis 
(1980), Mauldin and Williams (1986b) and Hunt (1998). Self-affine curves are 
the theme of a book by Massopust (1994), and self-affine curves are discussed in 
Bedford (1989). The theory of autocorrelation functions is given in most books 
on time series analysis, for example Papoulis (1962). 



Exercises 

11.1 Verify that if / : [0, 1] -*■ U. has a continuous derivative then graph/ is a regular 
1-set, see Section 5.1. 
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11.2 Let /, g : [0, 1] — > K be continuous functions and define the sum function f + g 
by (/ + g)(t) = f(t) + g(t). Suppose that / is a Lipschitz function. By setting 
up a Lipschitz mapping between graph (/ + g) and graph g, show that 
dim H graph (/ + g) = dim H graph g, with a similar result for box dimensions. 

11.3 Let /, g : [0, 1] — >• WL be continuous functions such that the box dimension of 
their graphs exist. Use Proposition 11.1 to show that dim B graph (/ + g) equals 
the greater of dime graph/ and dime graph g, provided that these dimensions are 
unequal. Give an example to show that this proviso is necessary. 

11.4 Show that any function satisfying the conditions of Corollary 11.2(b) with 1 < s < 
2 must be nowhere differentiable. Deduce that the Weierstrass function of Example 
11.3 and the self-affine curves of Example 11.4, are nowhere differentiable. 

11.5 For X > 1 and 1 < s < 2 let / : [0, 1] -*■ K be a Weierstrass function modified to 
include 'phases' Ok'- 



fit) = ^2x (s '- 2)k sm(X k t + k ). 



Show that dim B graph / = s, provided that X is large enough. 
11.6 Let g : K -» K be the 'zig-zag' function of period 4 given by 

It (0 < t < 1) 

1-t (1<(<3) 
t - 4 (3 < t < 4) 

where k is an integer and ^ t < 4. Let 1 < s < 2 and X > 1 and let / 
be given by 



/(o = X> <s ~ 2> *g(^o. 



Show that dim B graph/ = s, provided that X is sufficiently large. 

11.7 Suppose that the function / : [0, 1] — »• R satisfies the Holder condition (11.2). Let 
F be a subset of [0, 1]. Obtain an estimate for dim H /(F) in terms of dim H -F. 

11.8 Let / : [0, 1] -> R be a function. Suppose that 

I I [|/(0 - /(«)| 2 + \t - U\ 2 r /2 dt duKOO 

Jo Jo 

for some s with 1 < s < 2. Show, using Theorem 4.13, that dim H graph / > s. 

1 1.9 Let D be the unit square [0, 1] x [0, 1] and let / : D -»- K be a continuous function 
such that 

\f(x)-f(y)\^c\x-y\ 3 - s (x,yeD). 

Show that the surface {(x, f(x)) : x e D] has box dimension at most s. Similarly, 
find a surface analogue to part (b) of Corollary 11.2. 
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11.10 Consider the affine maps Si(t, x) = {\t, \t + |*), S 2 (t,x) = (\t + \, —\t + 
tX + j). Show that the attractor of {Si, 52} is the graph of a self-affine func- 
tion on [0, 1]. Sketch the first three stages in the construction of the graph, and 
find its box dimension. 

11.11 Answer the same question as 11.10 with the three affine maps S\(t, x) = {\t, \t + 
\x), S 2 (t, x) = (\t + |, -\t + \x + i), S 3 (t, x) = (\t + |, \t + \x - \). 

11.12 Estimate C(0) — C(h) where C(h) is the autocorrelation function of the Weierstrass 
function (11.4). 

11.13 Investigate the graphs of Weierstrass-type functions (1 1.7) using a computer. Exam- 
ine the effect of varying s and k, and experiment with various functions g. 

11.14 Write a computer program to draw self-affine curves given by (11.8). Investigate 
the effect of varying the values of the c,- . 



Chapter 12 Examples from pure 
mathematics 



Fractal constructions have provided counterexamples, and sometimes solutions, 
to a variety of problems where more regular constructions have failed. In this 
chapter we look at several simple but elegant instances from differing areas of 
pure mathematics. 



12.1 Duality and the Kakeya problem 

The method of duality converts sets of points in the plane to sets of lines and 
may be used to create new fractals from old. The techniques can be applied to 
construct sets with particular properties; for example, to construct a plane set of 
zero area that contains a line running in every direction. 

For each point (a, b) of IR 2 , we let L(a, b) denote the set of points on the 
line y — a + bx, see figure 12.1. If F is any subset of R 2 we define the line 
set L(F) to be the union of the lines corresponding to the points of F, i.e. 
L(F) — U{L(a, b) : (a, b) e F). Writing L c for the vertical line x — C, we have 

L(a, b)C\L c = (c, a + be) = (c, (a, b)-{\, c)), 

where '-'is the usual scalar product in R 2 ; thus for a subset F of IR 2 

L(F) n L c = {(c, (a, b)-(l, c)) : (a, b) e F}. 

Taking a scalar product with the vector (1, c) may be interpreted geometrically 
as projecting onto the line in the direction of (1, c) and stretching by a factor 
(1 + c 2 ) 1 / 2 . Thus the set L(F) D L c is a similar copy of proj fl F scaled by (1 + 
c 2 ) 1 / 2 , where proj fl denotes orthogonal projection onto the line through the origin 
at angle 9 to the x-axis, where c — tan 6. In particular, 

dim H (L(F) n L c ) = dim H (proj e F) (12.1) 
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(a,b) 



< > 




(a) 




Figure 12.1 The duality principle: (a) the point (a, b) corresponds to the line y = a + bx; 
(b) the set F corresponds to the line set L(F)\ (c) the projection proj e F is geometrically 
similar to L(F) n L c , where c = tanff 



and 



C(L(F) n L c ) = if and only if £(proj e F) = 



(12.2) 



where C denotes 1 -dimensional Lebesgue measure, i.e. length. In this way, duality 
relates the projections of F (for which we have the theory of Chapter 6) to the 
intersections of the line set L(F) with vertical lines. 

Projection onto the y-axis also has an interpretation. The gradient of the line 
L(a, b) is just b — proj^C^* b), so, for any F , the set of gradients of the lines 
in the line set L(F) is given by proj^F. 
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If F is a fractal its line set L(F) often has a fractal structure, albeit a highly 
fibrous one. (In fact, L{F) need not be a Borel set if F is Borel, though it will be 
if F is compact. We ignore the minor technical difficulties that this introduces.) 
We have the following dimensional relationship. 



Proposition 12.1 

Let L(F) be the line set of a Borel set F C IR 2 . Then 

(a) dimnL(F) ^ min{2, 1 + dime^}, and 

(b) if F is a l-set then area(L(f)) = if and only if F is irregular. 

Proof. 

(a) By the Projection theorem 6.1, dimHCproj^/ 7 ) = min{l, dmin^ } for almost 
all e [0, n), so from (12.1) dim H (L(F) n L c ) = min{l, dim H F} for 
almost all — oo < c < oo. Part (a) now follows from Corollary 7.10. 

(b) Let F be a l-set. Corollary 6.5 tells us that if F is irregular then 
£(pro] e F) — for almost all 0, otherwise £(proj e F) > for almost all 0. 
Using (12.2) we get the dual statement that if F is irregular then C(L(F) D 
L c ) — for almost all c, otherwise C(L(F) D L c ) > for almost all c. 
Since area(L(F)) = f^ C(L(F) D L c )dc, part (b) follows. D 

In 1917 Kakeya posed the problem of finding the plane set of least area 
inside which a unit line segment could be reversed, i.e. manoeuvred continu- 
ously without leaving the set to reach its original position but rotated through 
180°. Essentially, this problem reduces to that of finding the smallest region that 
contains a unit line segment in every direction; certainly any set in which a seg- 
ment can be reversed must have this property. By 1928 Besicovitch had found 
a surprising construction of a set of arbitrarily small area inside which a unit 
segment could be reversed. Only many years later did he realize that the method 
of duality gave a short and elegant solution to the problem. 

Proposition 12.2 

There is a plane set of zero area which contains a line in every direction. Any 
Borel set with this property must have Hausdorff dimension 2. 

Proof. Let F be any irregular l-set such that the projection of F onto the j-axis, 
proj^F, contains the interval [0, 1]. (The set of figure 0.4, see Examples 2.6 and 
6.7, certainly meets this requirement.) Since F is irregular, L(F ) has zero area, 
by Proposition 12.1(fo). However, since [0, 1] C proj n , 2 F, the set L(F) contains 
lines that cut the x-axis at all angles between and n/4. Taking L(F), together 
with copies rotated through n/4, n/2 and 3n/4, gives a set of area containing 
a line in every direction. 
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For the second part, suppose that E contains a line in every direction. If 

F = {(a,b) : L(a,b) C E] 

then proj n , 2 F is the entire j-axis. Projection does not increase dimension (see 
(6.1)), so dining ^ 1. By Proposition 12.1(a) dim H L(F) — 2; since L(F) C E 
it follows that dim H £ — 2. □ 

A set in U" that contains a line segment in every direction is called a Besicov- 
itch set. Proposition 12.2 shows that Besicovitch sets exist in R 2 , and taking a 
product with IR" -2 gives a Besicovitch set in E.' 1 . A long-standing conjecture is 
that every Besicovitch set in E." has dimension n; we have shown this if n — 2. 

Sets of this type have important applications in functional analysis. For a 
simple example, let § : [R 2 — > [R. and write G(6,t) for the integral of g along 
the line making angle 9 with the x-axis and perpendicular distance t from the 
origin. Let F be a set of zero area containing a line in every direction, and let 
g(x, y) — 1 if (x, y) is a point of F and g(x, y) — otherwise. It is clear that 
G(9, t) is not continuous in t for any fixed value of 9. This example becomes 
significant when contrasted with the 3-dimensional situation. If g : D — > U. is a 
bounded function on a bounded domain D in [R 3 , and G(9, t) is the integral of 
g over the plane perpendicular to the unit vector 9 and perpendicular distance t 
from the origin, it may be shown that G(9,t) must be continuous in t for almost 
all unit vectors 9. 

The Besicovitch construction may be thought of as a packing of lines in all 
directions into a set of area zero. Similar problems may be considered for pack- 
ings of other collections of curves. For example, there are sets of zero area that 
contain the circumference of a circle of every radius, see Exercises 12.1 and 
12.2. However, every set that contains some circle circumference centred at each 
point in the plane necessarily has positive area. 



12.2 Vitushkin's conjecture 

A long-standing conjecture of Vitushkin in complex potential theory was dis- 
proved using a fractal construction. For geometrical purposes we identify the 
complex plane C with the Euclidean plane [R 2 . 

Let Fbea compact subset of C. We say that F is a removable set if, given 
any bounded open domain V containing F and any bounded analytic (i.e. differ- 
entiable in the complex sense) function / on the complement V\F, then / has 
an analytic extension to the whole of V. Thus the functions that are bounded and 
analytic on V are essentially the same as those that are bounded and analytic on 
V\F; removing F makes no difference. 

The problem of providing a geometrical characterization of removable sets 
dates back many years. The removability, or otherwise, of F has now been 
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established in the following cases: 

Removable Not Removable 



dimnF < 1 dim^F > 1 

< Ti (F) < oo and F not irregular < H l (F) < oo and F not irregular 

This table should remind the reader of the projection theorems of Chapter 6. 
According to Theorem 6.1 and Corollary 6.5, if dim H F < 1 then the projection 
projg/ 7 has length for almost all 8, but if dim H F > 1 or if F is a 1-set that is 
not irregular, proj e F has positive length for almost all 8. This correspondence 
between removability and almost all projections having length 0, together with 
a considerable amount of further evidence makes Vitushkin's conjecture seem 
very natural: F is removable if and only if length (proj e F) — for almost all 
0e[O,jr). 

A fractal construction shows that Vitushkin's conjecture cannot be true. Let 
V be an open domain in C and let <j) : V — > <f>(V) be a conformal mapping (i.e. 
an analytic bijection) on V that is not linear, so that straight lines are typically 
mapped onto (non- straight) curves; V as the unit disc and (p(z) = (z + 2) 2 would 
certainly be suitable. It is possible to construct a compact subset F of V such 
that pro) e F has zero length for almost all 8 but pro) e 4>(F) has positive length for 
almost all 8. This may be achieved using a version of the 'iterated Venetian blind' 
construction, outlined in the proof of Theorem 6.9 — it may be shown that the 
'slats' can be arranged so that they generally miss straight lines in V, but tend to 
intersect the inverse images under <f> of straight lines in <p(V) (see figure 12.2). It 
follows that the property 'proj e F has zero length for almost all 8 ' is not invariant 
under conformal transformations, since it can hold for F but not (/>(F). However, 
removability is conformally invariant since the function f(z) is analytic on (p(V) 
(respectively on 4>(V \ F)) if and only if f(4>(z)) is analytic on V (respectively 
on V \ F). Therefore, the property of having almost all projections of zero length 
cannot be equivalent to removability. 



0(V) 





proj e F proj e <A(F) 

Figure 12.2 4> is an analytic mapping such that proj e F has zero length for almost all 
but proi g (p(F) has positive length for almost all 8 
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One of the curious features of this particular argument is that it leaves us none 
the wiser as to whether sets with almost all projections of zero length must be 
removable or vice versa. All we can deduce is that both cannot be true. 

Subsequently, a non-removable set with almost all projections of zero length 
was obtained using an iterated construction. 



12.3 Convex functions 

A continuous function / : R 2 — > R is convex if 

f (kx + (i _ X )y) ^ kf(x) + (1 - k)f(y) 

for all x, y e R 2 and ^ X ^ 1. Geometrically, if S = {(x, f(x)) : x e R 2 } is 
the surface in R 3 representing the graph of /, then / is convex if the line segment 
joining any two points of S lies in or above S. 

A convex function / need not be particularly smooth — there may be points 
where / is not differentiable. However, convexity implies that the set of such 
'singular' points cannot be too big, in the sense of dimension. Note that if / is 
not differentiable at x then the surface S supports more than one tangent plane at 
(x, fix)). Also notice that if P\ and Pi are distinct tangent planes at (x, f(x)) 
then there is a continuum of tangent planes through this point, namely those 
planes 'between P\ and Pi that contain the line P\ D Pi. 

Theorem 12.3 

Let f : R 2 — > R be a convex function. Then the set of points at which f is not dif- 
ferentiable is contained in a countable union of rectifiable curves, so in particular 
has Hausdorff dimension at most 1. 

Proof. Without loss of generality, we may assume that the minimum value of 
/ is strictly positive. Let S be the surface given by the graph of / and let 
g : R 2 — >■ S be the 'nearest point' mapping, so that if x e R 2 then g(x) is that 
point of S for which the distance \g(x) — x\ is least, see figure 12.3. Convexity 
of / guarantees that this point is unique. If x, y e R 2 then the angles of the 
(possibly skew) quadrilateral x, gix), g(y), y at g(x) and g(y) must both be at 
least jt/2; otherwise the segment (g(x), g(y)) will contain a point on or above 
S that is nearer to x or y. It follows that g is distance decreasing, i.e. 

IS«-g()0K \x-y\ (x,yeR 2 ). (12.3) 

If / fails to be differentiable at x, then S supports more than one tangent plane 
at (x, fix)). Thus g~ l (x, fix)), the subset of the coordinate plane R 2 mapped 
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Figure 12.3 The 'nearest point mapping' g from R 2 to the surface z = fix) of a convex 
function is distance decreasing 

to this point by g, is the intersection of IR 2 with the normals to the tangent planes 
to S at (x, fix)) and so contains a straight line segment. Let {L\, L2, . . .} be the 
(countable) collection of line segments in U. 2 with endpoints having rational coor- 
dinates. If / is not differentiable at x then g~ l (x, fix)) contains a segment which 
must cut at least one of the L,. Thus if F — {ix, fix)) : f is not differentiable 
at x) then U^i 8 (Li) D F. Using (12.3) it follows that giL t ) is either a point or a 
rectifiable curve with H l igiLj)) sC length (L,-) < 00; see (2.9). Then |J~i 8( L i) 
is a countable union of rectifiable curves containing F, which in particular has 
dimension at most 1. 

Since \x — y\ ^ \(x, fix)) — iy, fiy))\ for x, y e IR 2 , the set of points x at 
which / is non-differentiable is also of dimension at most 1 and is contained in 
a countable collection of rectifiable curves; again see (2.9). □ 

Hausdorff dimension has been used in various other ways to quantify the 
irregularity of surfaces. For example, a convex surface may contain line segments; 
however, the set of directions of such line segments may be shown to have 
dimension at most 1. 



12.4 Groups and rings of fractional dimension 

A subset F of E. is a subgroup of the real numbers under the operation of 
addition if 



(i) e F, 

(ii) x + y e F whenever x e F and y e F, and 
(iii) — x e F whenever x e F. 
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The set F is a subring of E. under addition and multiplication if, also, 
(iv) xy e F whenever x e F and y e F. 

There are many simple examples of such structures: the integers, the rationals 
and the set of numbers {r + sv 2 : r, s e Z} are all subrings (and therefore sub- 
groups) of IR. These examples are countable sets and therefore have Hausdorff 
dimension 0. Do there exist subgroups and subrings of E. of Hausdorff dimension 
sif0<s< 1? 

It is relatively easy to modify the earlier Example 4.7 to obtain a subgroup of 
any given dimension. 

Example 12.4 

Fix < s < 1. Let Mo, «i, «2> ■ ■ • be a rapidly increasing sequence of integers, 

i 1 IS 

say with n k +\ ^ maxjnj, 4n k } for each k. For r — 1,2,... let 

— 1 Is 

F r — [x e B. : \x — p/n k \ ^ rn k for some integer p, for all k] 

and let F — U r =i F r - Then dim H -F = s , and F is a subgroup of IR under addition. 

Calculation. F r is essentially the set of Example 4.7, so dime/v = s for all r 
(it is easy to see that the value of r does not affect the dimension). Taking a 
countable union, dim H F — s. 

Clearly e F\ C F. If x, y e F then x, y e F r for some r, noting that F r > C 
F r if r ^ r' . Thus, for each k, there are integers p, q such that 

\x - p/n k \ ^ rn k l/s \y - q/n k \ ^ rn^ l/s . 

Adding, 

\x + y - (p + q)/n k \ ^ 2rn k 1/s 

so i + } 6 Fir C F. Clearly, if x e F r then — x e F r , so F satisfies conditions 
(i)-(iii) above. □ 

Subrings are rather harder to analyse. One geometrical approach depends on 
estimating the dimension of the set of distances realized by a plane set. If E is 
a subset of IR 2 , we define the distance set of E by 

D(E) = {\x-y\ :x,y e E) c K. 

Theorem 12.5 

Le/ E C $L 2 be a Borel set. Then 

dim H D(E) ^ min{l, dim H £ - \}. (12.4) 
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Note on proof. The potential theoretic proof of this theorem is a little complicated. 
Fourier transforms and the convolution theorem are used to examine the circles 
with centres in E that intersect E. It is unlikely that (12.4) is the best inequality 
possible. □ 

Assuming this theorem, it is not difficult to obtain a partial solution to the 
subrings problem. 

Theorem 12.6 

Let F be a subring of K under addition and multiplication. If F is a Borel set 
then either dimuF — or F — R. 

Partial proof. We use a geometrical method to show there are no subrings of 
dimension between j and 1. Using (x, y) coordinates in IR 2 , if (x\, y\), {x 2 , y 2 ) £ 
fxfcl 2 , then |0ci, y t ) - (x 2 , y 2 )\ 2 = (*i - x 2 f + (ji - yi) 2 e F, since F 
is a subring. Thus, if D 2 (F x F) denotes the set of squares of distances between 
points of F x F, we have D 2 (F x F) C F . Since the mapping / —¥■ t 2 preserves 
Hausdorff dimension (see Exercise 2.6) we have 

dimnF ^ dim^D (F x F) — dimuD(F x F) 
^ min{l, dimu(F x F) — j} 
^ min{l, 2dimH/ ;, — 5} 

using Theorem 12.5 and Product formula 7.2. This inequality is satisfied if and 
only if dim H ^ = 1 or dim^ SC i D 



12.5 Notes and references 

More detailed accounts of the Kakeya problem and its variants are given in Besi- 
covitch (1963), Cunningham (1974) and Falconer (1985a) and the dual approach 
was introduced by Besicovitch (1964). For recent surveys see Wolff (1999) and 
Katz and Tao (2000). The book by Stein (1993) includes applications of Kakeya 
sets to functional and harmonic analysis. 

For discussions on Vitushkin's conjecture, see Mattila (1995), Murai (1988) 
and David (1999). The construction outlined is due to Mattila (1986) and the 
non-removable set with projections of lengths zero is due to Jones and Murai 
(1988). David (1999) showed that irregular 1-sets are removable. 

For general information on convex geometry see Gruber and Wills (1993). The 
result given here is due to Anderson and Klee (1952). For other results involving 
Hausdorff dimension and convexity, see Dalla and Larman (1980) and the survey 
of Schneider (1993). 
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Examples of groups of fractional dimension were given by Erdos and Volk- 
mann (1966) who raised the question of dimensions of rings. Falconer (1985c) 
used Fourier transform methods to show that Borel rings could not have dimen- 
sion between j and 1, see also Mattila (1995). The complete proof of Theo- 
rem 12.6 is due to Edgar and Miller (2003). 



Exercises 

12.1 Construct a plane set of zero area that contains a line at every perpendicular dis- 
tance from the origin between and 1. (Hint: consider the image of the set F in 
Proposition 12.2 under the transformation (a, b) — > (a(l + b 2 ) l/2 , b).) 

12.2 By transforming the set obtained in the previous exercise by the mapping given in 
polar coordinates by (r, 6) -*■ (l/r,6), show that there exists a plane set of zero 
area that contains a circle of radius r for all r > 0. 

12.3 Show that there is a subset of the plane of area that contains a different straight 
line through every point on the *-axis. 

12.4 Let A be a (Borel) subset of [0, n). Let F be a subset of the plane that contains a 
line running in direction for every 6 e A. Show that dim H F ^ 1 + dim H A. 

12.5 Dualize Theorem 6.9 to show that any Borel set of finite area a may be completely 
covered by a collection of straight lines of total area a. 

12.6 Show that if a compact subset F of C supports a mass distribution fi such that 
f(z) = f F (z — w)~ ] d(i(w) is bounded then F is not removable in the sense of 
Section 12.2. Show that this is the case if 1 < dim H F ^ 2. (Hint: see the proof of 
Theorem 4.13(A).) 

12.7 Show that every finite set is removable. Show that the unit circle is not removable. 

12.8 Let / : I ^ I be a convex function. Show that the set of points at which / is 
not differentiable is finite or countable. 

12.9 Find a convex / : R 2 — > K such that the set of points of non-differentiability has 
Hausdorff dimension 1. Find a (non-convex) / : R — *■ K such that the set of points 
of non-differentiability has Hausdorff dimension 2. 

12.10 Show that any subgroup of R under addition has box dimension or 1. 

12.11 Show that, for all < s < 2, there is a subgroup of R 2 under vector addition with 
Hausdorff dimension 2. (Hint: consider a product of subgroups of KL) 



Chapter 13 Dynamical systems 



Dynamical systems continue to be intensively studied, both from theoretical and 
applied viewpoints. The availability of powerful computers has allowed numeri- 
cal investigation to complement theoretical analysis. Moreover, 'topological' 
methods for studying the qualitative behaviour of systems augment the more 
traditional quantitative approach. The subject is stimulated by an increasingly 
diverse range of applications — dynamical systems are now used to model phe- 
nomena in biology, geography, economics and internet traffic as well as in the 
traditional disciplines of engineering and physics. Volumes have been written 
on dynamical systems and chaos. We make no attempt to provide a general 
account, which would require excursions into ergodic theory, bifurcation theory 
and many other areas, but we illustrate various ways in which fractals can occur 
in dynamical systems. 

Let D be a subset of W (often W itself), and let / : D — > D be a continuous 
mapping. As usual, /* denotes the kth iterate of /, so that f (x) — x, f (x) — 
f(x), f 2 (x) — f(f(x)), etc. Clearly f k (x) is in D for all k if x is a point 
of D. Typically, x, f(x), f 2 (x), . . . are the values of some quantity at times 
0, 1,2,.... Thus the value at time k + 1 is given in terms of the value at time k 
by the function /. For example, f k (x) might represent the size after k years of 
a biological population or the value of an investment subject to certain interest 
and tax conditions. 

An iterative scheme {f k } is called a discrete dynamical system. We are inter- 
ested in the behaviour of the sequence of iterates, or orbits, [f k (x)}^ =l for various 
initial points x e D, particularly for large k. For example, if fix) — cosjc, the 
sequence f k (x) converges to 0.739 . . . as k — > oo for any initial x: try repeatedly 
pressing the cosine button on a calculator and see! Sometimes the distribution 
of iterates appears almost random. Alternatively, f (x) may converge to a. fixed 
point w, i.e. a point of D with f(w) — w. More generally, f k (x) may converge 
to an orbit of period-p points {w, f(w), . . . , f p ^ 1 (w)}, where p is the least posi- 
tive integer with f p (w) — w, in the sense that \f k (x) — f k (w)\ —> as k — > oo. 
Sometimes, however, f k (x) may appear to move about at random, but always 
remaining close to a certain set, which may be a fractal. In this chapter we 
examine several ways in which such 'fractal attractors' or 'strange attractors' 
can occur. 



Fractal Geometry: Mathematical Foundations and Application. Second Edition Kenneth Falconer 
© 2003 John Wiley & Sons, Ltd ISBNs: 0-470-84861-8 (HB); 0-470-84862-6 (PB) 

186 



Repellers and iterated function systems 187 

Roughly speaking, an attractor is a set to which all nearby orbits converge. 
However, as frequently happens in dynamical systems theory, the precise def- 
inition varies between authors. We shall call a subset F of D an attractor for 
/ if F is a closed set that is invariant under / (i.e. with f(F) — F) such that 
the distance from f k (x) to F converges to zero as k tends to infinity for all x 
in an open set V containing F. The largest such open set V is called the basin 
of attraction of F. It is usual to require that F is minimal in the sense that it 
has no proper subset satisfying these conditions. Similarly, a closed invariant set 
F from which all nearby points not in F are iterated away is called a repeller; 
this is roughly equivalent to F being an 'attractor' for the (perhaps multivalued) 
inverse / _1 , An attractor or repeller may just be a single point or a period-/? 
orbit. However, even relatively simple maps / can have fractal attractors. 

Note that f(D) C D so that /'(D)c/ w (0)C..C/(D)CO, so 
fl?=i /'(£>) = f k (D). Thus the set F = f|S=, f k (D) is invariant under /. Since 
f k (x) e nf=i f'(D) for all x e D, the iterates f h (x) approach F as k ->■ oo, 
and F is often an attractor of /. 

Very often, if / has a fractal attractor or repeller F, then / exhibits 'chaotic' 
behaviour on F. There are various definitions of chaos; / would certainly be 
regarded as chaotic on F if the following are all true: 

(i) The orbit {f k (x)} is dense in F for some x e F. 
(ii) The periodic points of / in F (points for which f p (x) — x for some 

positive integer p) are dense in F. 
(iii) / has sensitive dependence on initial conditions; that is, there is a number 
S > such that for every x in F there are points y in F arbitrarily close to 
x such that \f k (x) — f k (y)\ ^ S for some k. Thus points that are initially 
close to each other do not remain close under iterates of /. 

Condition (i) implies that F cannot be decomposed into smaller closed invari- 
ant sets, (ii) suggests a skeleton of regularity in the structure of F, and (iii) 
reflects the unpredictability of iterates of points on F. In particular, (iii) implies 
that accurate long-term numerical approximation to orbits of / is impossible, 
since a tiny numerical error will be magnified under iteration. Often conditions 
that give rise to fractal attractors also lead to chaotic behaviour. 

Dynamical systems are naturally suited to computer investigation. Roughly 
speaking, attractors are the sets that are seen when orbits are plotted on a com- 
puter. For some initial point x one plots, say, 10,000 iterates f k (x), starting at 
k — 101, say, to ensure that they are indistinguishable from any attractor. If an 
attractor appears fractal, a 'box-counting' method can be used to estimate its 
dimension. However, computer pictures can be misleading, in that the distribu- 
tion of f k (x) across an attractor can be very uneven, with certain parts of the 
attractor visited very rarely. 

13.1 Repellers and iterated function systems 

Under certain circumstances, a repeller in a dynamical system coincides with the 
attractor of a related iterated function system. This is best seen by an example. 
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The mapping / : 
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given by 
/(*) = 



|(l-|2x-l|) 



is called the tent map because of the form of its graph; see figure 13.1. Clearly, 
/ maps K in a two-to-one manner onto (—00, |). Defining an iterated function 
system S\, S2 '■ [0, 1] — >■ [0, 1] by the contractions 



Si(x) 



S 2 (x) 



1- ijc 



we see that 



f(S 1 (x)) = f(S 2 (x)) = x (O^iO). 



Thus Si and S 2 are the two branches of / . Theorem 9.1 implies that there is 
a unique non-empty compact attractor F C [0, 1] satisfying 



F = S 1 (F)US 2 (F) 



(13.1) 



1.5 



1.0: 



0.5 



/(*) = -|(l-|2*-l|) 




Figure 13.1 The tent map /. Notice that the middle third Cantor set F is mapped onto 
itself by / and is an invariant repeller. Notice, also, the chaotic nature of / on F: the 
iterates of a point are indicated by the arrows 
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which is given by F = fl^o 5 *([°> 1]) (writing S(E) = Si(E) U S 2 (E) for any 
set E). Clearly the attractor F is the middle third Cantor set, with Hausdorff and 
box dimensions log 2/ log 3. 

It follows from (13.1) that f(F) — F . To see that F is a repeller, observe 
that if x < then f(x) — 3x, so f k (x) — 3 k x — >■ — oo as k — > oo. If x > 1 
then f{x) < and again f k (x) — > — oo. If x e [0, l]\F then for some &, we 
have x $ S k [0, 1] = U{S h ° ■ ■ ■ °S k [0, 1] : ij = 1, 2} so /*(*) ^ [0, 1], and again 
f k (x) — >■ — oo as /: — > oo. All points outside F are iterated to — oo so F is 
a repeller. 

With the notation of Section 9.1, the chaotic nature of / on F is readily 
apparent. Denoting the points of F by Jc,, , 2i . with ij — 1,2, as in (9.7), 
\Xi u h,... - Xi' v i' v ...\ < 3~ 4 if i l =i[,..., i k = i' k . Since x iuh ,... = S h (x h , h ,...), it 
follows that f(x tl j 2 ) — jt,- 2 ,- 3 . Suppose that (ii, ('2, • • •) is an infinite sequence 
with every finite sequence of Is and 2s appearing as a consecutive block of terms; 
for example, 

(1,2, 1,1, 1,2,2,1,2,2, 1,1,1, 1,1,2,...) 

where the spacing is just to indicate the form of the sequence. For each point 
x,' ,< in F and each integer q, we may find k such that (i[, i' 2 , , , . , /') = 
(4+i, . . . , i k+q ). Then |-x, i+1 ,, i+2 ,... - *,•;,,■;,...! < 3~ ? , so the iterates f k (x iui2t ...) = 
x 'k+i,ik+2,- come arbitrarily close to each point of F for suitable large k, so 
that / has dense orbits in F. Similarly, since x,, ... 4 ^ IJ: ^ is a periodic 
point of period k, the periodic points of / are dense in F. The iterates have 
sensitive dependence on initial conditions, since f k {x ix ik x) e [0, |] but 
f k (.Xii,...,i t ,2,...) G [|, 1]. We conclude that F is a chaotic repeller for /. (The 
study of / by its effect on points of F represented by sequences (i\, ii, ■ . .) is 
known as symbolic dynamics.) 

In exactly the same way, the attractors of general iterated function systems 
correspond to repellers of functions. If S\ , . . . , S m is a set of bijective contractions 
on a domain D with attractor F such that S\ (F), . . . , S m (F) are disjoint, then F 
is a repeller for any mapping / such that f(x) — S ( ~ (x) when x is near Sj(F). 
Again, by examining the effect of / on the point x,, , 2 . , it may be shown that 
/ acts chaotically on F. Indeed, for many dynamical systems /, it is possible 
to decompose the domain D into parts (called a Markov partition) such that the 
branches of /~' on each part look rather like an iterated function system. See 
Theorem 14.15 for an example in the complex plane. 



13.2 The logistic map 

The logistic map / : IR — >• K is given by 

f x (x) = U(l-x) (13.2) 
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where X is a positive constant. This mapping was introduced to model the popu- 
lation development of certain species — if the (normalized) population is x at the 
end of a given year, it is assumed to be fx(x) at the end of the following year. 
Nowadays the logistic map is studied intensively as an archetypal 1 -dimensional 
dynamical system. We content ourselves here with an analysis when X is large, 
and a brief discussion when X is small. 

For X > 2 + ^/5 — 4.236 ... we get a nonlinear version of the tent map 



repeller of Section 13.1. Write a=\-J\-\/X and \-a=\ + J\-\/X 
for the roots of f\(x) — 1. Each of the intervals [0, a] and [1 — a, 1] is mapped 
bijectively onto [0, 1] by fx. The mappings S\ : [0, 1] — > [0, a] and S2 : [0, 1] — > 
[1 — a, 1] given by 

Si{x) = \-yl\-x/k S 2 (x) = \ + J\^xTx 

are the values of the inverse f^ 1 in [0, a] and [1 —a, 1], with fx(Si(x)) — 
fxiSiix)) — x for each x in [0, 1]. For i = 1, 2 we have 



1 
2X 



\s' t (.x)\ = — {\-x/xr 1 ' 2 



so 

7 ^ \S\{x)\ ^ -U± - i/xy 1 ' 2 = i(A 2 /4 - xy 1 ' 2 

if ^ x ^ 1. By the mean- value theorem 

\\x -yK \Si(x) - S f (y)\ < \(X 2 /A - k)~ 1/2 \x - y\ (0^x,y^ 1). 
X L 

(13.3) 

Thus, if X > 2 + *J5 the mappings Si and S2 are contractions on [0, 1], so by The- 
orem 9.1 the iterated function system {Si, S2} has a unique (non-empty compact) 
attractor F C [0, 1] satisfying 

F = S 1 (F)US 2 (F), 

and it follows that fx(F) — F. Since this union is disjoint, F is totally dis- 
connected. In exactly the same way as for the tent map, F is a repeller, with 
fx(x) — > —00 if x <£ F, and / is chaotic on F . 

To estimate the dimension of F we proceed as in Example 9.8. Using Propo- 
sitions 9.6 and 9.7 it follows from (13.3) that 

log 2 log 2 

log A. B log(A(l - 4A)!/2) 

Thus, if X is large, the dimension of F is close to log 2/ log A. 
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For smaller values of X, the dynamics of the logistic map (13.2) are subtle. If 

< X ^ 4, the function fi maps [0, 1] into itself, and we can restrict attention 
to the interval [0, 1]. If x is a period-/? point of /, i.e. f p (x) — x and p is 
the least positive integer with this property, we say that x is stable or unstable 
according to whether \(f p )'(x)\ < 1 or > 1. Stable periodic points attract nearby 
orbits, unstable periodic points repel them. If < X ^ 1, then f x has a fixed 
point at which is attractive, in the sense that f x (x) — > for all i e [0, 1]. For 

1 < X < 3, the function f x has an unstable fixed point 0, and a stable fixed point 
1 — 1/X, so f k k (x) — > 1 — l/X for all x e (0, 1). As X increases through the value 
^i = 3, the fixed point at 1 — l/X becomes unstable, splitting into a stable orbit 
of period 2 to which all but countably many points of (0, 1) are attracted (see 
figure 13.2). When X reaches X2 — 1 + *J6, the period-2 orbit becomes unstable 
and is replaced by a stable period-4 orbit. As X is increased further, this period 
doubling continues with a stable orbit of period 2 q appearing at X — X q ; this orbit 
attracts all but countably many initial points in (0, 1). 

One of the surprising features of this process is that the period doubling occurs 
more and more frequently as X increases, and q — > 00 as X — > Xoq where X x — 
3.570. As X approaches X^, the repeated splitting of stable orbits of period 2 q 
into nearby stable orbits of period 2 q+1 provides a sequence of attracting orbits 
which approximate to a Cantor set; see figure 13.3. 

When X — X^ the attractor F actually is a set of Cantor type. Then F is invari- 
ant under / loo with all except a countable number of points of [0, 1] approaching 




Figure 13.2 The logistic map fi(x) 
x u x 2 with fx(xi) = x 2 and f k (pc 2 ) = 



Xx(l — x) for X = 3.38. Note the period-2 orbit 
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Figure 13.3 For each A the iterates /*(*) are plotted for k between 150 and 300, for 
a suitable initial x. The intersection of the plot with vertical lines shows the periodic 
attractors for X < X^ . As X approaches X^ , repeated splitting of the periodic orbits results 
in an attractor of Cantor set form at X = Xoo 



F under iteration by fa x (the exceptional points are those that iterate onto the 
unstable periodic orbits). The effect of fx x on F can be analysed by extrapolat- 
ing from the periodic orbits of fi when q is large. There are dense orbits but 
no sensitive dependence on initial conditions. It is possible to show that F is the 
attractor in the sense of (9.2) of a certain iterated function system, and, using the 
method of Example 9.8, the Hausdorff dimension may be estimated as 0.538 ... . 
A complete analysis of the structure of this fractal attractor is beyond the scope 
of this book. 

For A.OO < X < 4 several types of behaviour occur. There is a set of parameters 
K such that if X e K then fi has a truly chaotic attractor of positive length. 
Moreover, K itself has positive Lebesgue measure. However, in the gaps or 
'windows' of K, period doubling again occurs. For example, when X ~ 3.83 
there is a stable period-3 orbit; as X increases it splits first into a stable period-6 
orbit, then into a stable period-12 orbit, and so on. When X reaches about 3.855 
the 'limit' of these stable orbits gives a Cantor-like attractor. Similarly there are 
other windows where period doubling commences with a 5-cycle, a 7-cycle and 
so on. 

One of the most fascinating features of this period doubling is its universal- 
ity: the behaviour of the logistic map as X increases is qualitatively the same 
as that of any family of transformations of an interval fx(x) — Xf(x), pro- 
vided that / is unimodal, i.e. has a single maximum at c say, with /"(c) < 0. 
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Although the values A4, X2, ■ ■ ■ at which period doubling occurs depend on /, the 
rate at which these values approach Aoo is universal, i.e. 1^ — X k — c8~ k where 
S — 4.6692 ... is the Feigenbaum constant and c depends on /. Moreover, the 
Hausdorff dimension of the fractal attractor of /^ is 0.538 . . . , this same value 
occurring for any differentiable and unimodal /. 

Mappings which have been used to model biological populations and which 
exhibit similar features include the following: 



fx(x) — X sinjrx 
f x (x) — x expl(l — x) 
f x (x) =x(l+ Ml -x)) 
fx(x) — Xx/(1 + ax) 5 . 



13.3 Stretching and folding transformations 

One of the simplest planar dynamical systems with a fractal attractor is the 
'baker's' transformation, so-called because it resembles the process of repeatedly 
stretching a piece of dough and folding it in two. Let E — [0, 1] x [0, 1] be the 
unit square. For fixed < X < j we define the baker's transformation f : E — > 

I (2x - 1, Xy + 5) (5 < x ^ 1). 

This transformation may be thought of as stretching E into a 2 x X rectangle, 
cutting it into two 1 x X rectangles and placing these above each other with a gap 
of j — X in between; see figure 13.4. Then E k — f k {E) is a decreasing sequence 
of sets, with E k comprising 2 k horizontal strips of height X k separated by gaps of 
at least (j — X)X k ~ l . Since /"(£/;) = E*+i, the compact limit set F — Htlo ^* 
satisfies f(F) — F . (Strictly speaking, f{F) does not include part of F in the 
left edge of the square E , a consequence of / being discontinuous. However, this 
has little effect on our study.) If (x, y) e E then f k (x, y) e E^, so f k (x, y) lies 
within distance X k of F . Thus all points of E are attracted to F under iteration 

by /• 

If the initial point (x, y) has x — 0.a\a2 ... in base 2, and x 7^ \, 1 then it is 
easily checked that 

f k (x, y) = (0.a k+l a k+2 ...,y k ) 

where y k is some point in the strip of E k numbered a^a^i . . . a\ (base 2) counting 
from the bottom with the bottom strip numbered 0. Thus when k is large the 
position of f k (x, y) depends largely on the base-2 digits a,- of x with i close 
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(«) 



(b) 
Figure 13.4 The baker's transformation: (a) its effect on the unit square; (b) its attractor 



to k. By choosing an x with base 2 expansion containing all finite sequences, we 
can arrange for f k (x, y) to be dense in F for certain initial (x, y), just as in the 
case of the tent map. 

Further analysis along these lines shows that / has sensitive dependence on 
initial conditions, and that the periodic points of / are dense in F, so that 
F is a chaotic attractor for /. Certainly F is a fractal — it is essentially the 
product [0, 1] x F[, where F\ is a Cantor set that is the IFS attractor of S\{x) — 
Xx, S2OO — \+ Ajc. Theorem 9.3 gives dim H Fi = din^/^ = log 2/ — log A, so 
dimH-F = 1 + log 2/ — log!, using Corollary 7.4. 

The baker's transformation is rather artificial, being piecewise linear and dis- 
continuous. However, it does serve to illustrate how the 'stretching and cutting' 
procedure results in a fractal attractor. 

The closely related process of 'stretching and folding' can occur for continuous 
functions on plane regions. Let E — [0, 1] x [0, 1] and suppose that / maps E 
in a one-to-one manner onto a horseshoe-shaped region f(E) contained in E. 
Then / may be thought of as stretching E into a long thin rectangle which is 
then bent in the middle. This figure is repeatedly stretched and bent by / so that 
f k (E) consists of an increasing number of side-by-side strips; see figure 13.5. We 
have E D f(E) D f 2 (E) D ■ ■ ■, and the compact set F = (~)™ =l f k (E) attracts 
all points of E. Locally, F looks like the product of a Cantor set and an interval. 



Stretching and folding transformations 



195 



A variation on this construction gives a transformation with rather different 
characteristics; see figure 13.6. If D is a plane domain containing the unit square 
E and / : D — > D is such that f(E) is a horseshoe with 'ends' and 'arch' lying 
in a part of D outside E that is never iterated back into E, then almost all points of 
the square E (in the sense of plane measure) are eventually iterated outside E by 
/. If /*(*, y) e E for all positive k, then (x, y) e f|^i /"*(£)• With / suitably 
defined, f~ l (E) consists of two horizontal bars across E, so Htli f~ k (E) is the 
product of [0, 1] and a Cantor set. The set F = fl^Loo f k ( E ) = C\T=o f k( - E "> n 
Plfcli f~ k (E) ls compact and invariant for /, and is the product of two Cantor 
sets. However, F is not an attractor, since points arbitrarily close to F are iterated 
outside E. 




Figure 13.5 A horseshoe map. (a) The square E is transformed, by stretching and bend- 
ing, to the horseshoe f(E), with a, b, c, d mapped to a' , b' , c' , d', respectively, (b) The 
iterates of E under / form a set that is locally a product of a line segment and a 
Cantor set 



196 



Dynamical systems 





Q 




/(£) 







V a' d' 

(a) 



II U II II 



n; =1 r k (E) 



r); =0 f k (E) 

(b) 



F=n?- n f k (t)nrc_j- k (E) 



Figure 13.6 An alternative horseshoe map. (a) The square E is transformed so that the 
'arch' and 'ends' of f(E) lie outside E. (b) The sets f|Hi f~ k (E) and n£=o/*( £ ) 
are both products of a Cantor set and a unit interval. Their intersection F is an unstable 
invariant set for / 



A specific example of a 'stretching and folding' transformation is the Henon 

map / : R 2 -+ \& 2 



f(x, y) = (y + 1 — ax , bx) 



(13.5) 



where a and b are constants. (The values a — 1.4 and b — 0.3 are usually chosen 
for study. For these values there is a quadrilateral D for which f(D) C D to 
which we can restrict attention.) This mapping has Jacobian — b for all (x,y), 
so it contracts area at a constant rate throughout R 2 ; to within a linear change 
of coordinates, (13.5) is the most general quadratic mapping with this property. 
The transformation (13.5) may be decomposed into an (area-preserving) bend, a 
contraction, and a reflection, the net effect being 'horseshoe-like'; see figure 13.7. 
This leads us to expect / to have a fractal attractor, and this is borne out by 
computer pictures, figure 13.8. Detailed pictures show banding indicative of a 
set that is locally the product of a line segment and a Cantor-like set. Numerical 
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Area-preserving bend 
(xj, yj) = (x, 1 - ax 2 + y) 




Contraction in x direction 

(X2,y 2 ) = (bx l ,y l ) 




Reflection in the line y ■ 
(x 3 , y 3 ) = (y 2 , x 2 ) 



Figure 13.7 The Henon map may be decomposed into an area-preserving bend, followed 
by a contraction, followed by a reflection in the line y = x. The diagrams show the effect 
of these successive transformations on a rectangle 



estimates suggest that the attractor has box dimension of about 1 .26 when a — 1 .4 
and b = 0.3. 

Detailed analysis of the dynamics of the Henon map is complicated. In par- 
ticular the qualitative changes in behaviour (bifurcations) that occur as a and b 
vary are highly intricate. 

Many other types of 'stretching and folding' are possible. Transformations can 
fold several times or even be many-to-one; for example the ends of a horseshoe 
might cross. Such transformations often have fractal attractors, but their analysis 
tends to be difficult. 
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Figure 13.8 Iterates of a point under the Henon map (13.5) showing the form of the 
attractor. Banding is apparent in the enlarged portion in the inset 



13.4 The solenoid 

Our next example is of a transformation of a 3-dimensional region — a solid torus. 
If a unit disc B is rotated through 360° about an axis L in the plane of, but not 
intersecting, B, a solid torus D is swept out. The torus D may be thought of as 
the product of the circle C, of radius r > 1, obtained by rotating the centre of B 
around L, and B. This gives a convenient parametrization of D as 

{(0, w) e C x B : ^ c/> < 2n, \w\ ^ 1} 

where the angle <f> specifies a point on C, and where id is a position vector 
relative to the centre of B; see figure 13.9. 
Fix < a < j and define / : D — > D by 



f(4>, w) — (20(mod 2ic), aw + i</>) 



(13.6) 
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Figure 13.9 Parametrization of the torus D 



where <f> is the unit vector in B at angle <f> to the outwards axis. Then / maps 
D onto a solid tube of radius a that traverses D twice, see figure 13.10. Note 
that (</>, w) and (0 + n, w) are mapped to points in the same 'slice' of D, that is 
the same half-plane bounded by L. The second iterate f 2 (D) is a tube of radius 
a 2 going round f(D) twice, so around D four times; f 3 (D) traverses D eight 
times, etc. The intersection F — Htli f (D) is highly fibrous — locally it looks 
like a bundle of line segments that cut any cross section of D in a Cantor-like 
set. The set F, called a solenoid, is invariant under /, and attracts all points 
of D. 

We may find the dimension of F by routine methods. Let P$ be the half- 
plane bounded by L and cutting C at <fi. Observe that f k (C) is a smooth 
closed curve traversing the torus 2 k times, with total length at most 2 k c where 
c is independent of k(f k (C) cannot oscillate too wildly — the angle between 
every curve f k (C) and every half-plane P^ has a positive lower bound). The 
set f k (D) is a 'fattening' of the curve f k (C) to a tube of radius a k , so it 
may be covered by a collection of balls of radius 2a k spaced at intervals of 
a k along f k (C). Clearly, 2 x 2 k ca~ k balls will suffice, so applying Proposi- 
tion 4.1 in the usual way we get dimnF SC dima^ $5 s and H S (F) < oo for 
.s = l+ log 2/ — log a . 

To get a lower estimate for the dimension, we examine the sections F D P^ for 
each <j). The set f{D) D P$ consists of two discs of radius a situated diametrically 
opposite each other with centres j apart inside D D P$. Each of these discs 
contains two discs of f 2 (D) D P^ of radius a 2 and with centres ^a apart, and so 
on. We may place a mass distribution fi on F D P^ in such a way that each of 
the 2 k discs of f k (D) n P has mass 2~ k . If U C P^ satisfies 



a k (\-2d) ^ \U\ <a k '\\ 
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(«) 




Figure 13.10 The solenoid, (a) The torus D and its image under /. (b) A plane section 
through D intersects F in a Cantor-like set 



for some positive integer k, then U intersects at most one disc of f k {D) D P<p, so 

H(U) ^ 2~ k = a *aog2/-loga) ^ Ci | {/ |log2/-log fl 

where C\ is independent of \U\. It follows from the Mass distribution principle 
4.2 that 



H 



log 2/ -log a 



(F n P ) ^ c - 



Since F is built up from sections F D P,p (0 ^ 4> < 2n), a higher-dimensional 
modification of Proposition 7.9 implies H s (F) > 0, where 5=1 + log 2/ — log a. 
We conclude that dimuF — dime/ 7 = s, and that < H S (F) < oo. 

If 4>/2ir — ■ <2i<22 ■ • • to base 2, it follows from (13.6) that f k (4>, w) — (<pk, u&) 
where 4>k/2ji — ■ ak+ia^+i ■ ■ ■ and where the integer with base-2 representation 
aka.k-\ ■ ■ ■ Ok-d+i determines which of the 2 d discs of f d (D) D P ( p l the point i>j. 
belongs to for d ^ k. Just as in previous examples, suitable choice of the digits 
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a\, a-i, . . . leads to initial points (<p, w) with f k ((j), w) dense in F, or alternatively 
to periodic orbits, so that / is chaotic on F. 



13.5 Continuous dynamical systems 

A discrete dynamical system may be thought of as a formula relating the value of 
a quantity at successive discrete time intervals. If the time interval is allowed to 
tend to 0, then the formula becomes a differential equation in the usual way. Thus 
it is natural to regard an autonomous (time-independent) differential equation as 
a continuous dynamical system. 

Let D be a domain in R n and let / : D —> R n be a smooth function. The 
differential equation 

dx 
x = — = fix) (13.7) 

at 

has a family of solution curves or trajectories which fill D. If an initial point x(Q) 
is given, the solution x(t) remains on the unique trajectory that passes through 
x(0) for all time /; the behaviour of x(t) as t —> ±oo may be found by follow- 
ing the trajectory. Given reasonable conditions on /, no two trajectories cross; 
otherwise the equations (13.7) would not determine the motion of x. Moreover, 
the trajectories vary smoothly across D except at points where x — fix) — 
and the trajectories are single points. 

As in the discrete case, continuous dynamical systems give rise to attractors 
and repellers. A closed subset F of D might be termed an attractor with basin 
of attraction V containing F if, for all initial points x(0) in the open set V, the 
trajectory x(t) through x(0) approaches F as t tends to infinity. Of course, we 
require F to be invariant, so that if x(0) is a point of F then x(t) is in F for 
— oo < / < oo implying that F is a union of trajectories. We also require F to 
be minimal, in the sense that there is some point x(0) such that x(t) is dense 
in F. 

When D is a plane domain, the range of attractors for continuous systems 
is rather limited. The only attractors possible are isolated points (x for which 
fix) — in (13.7)) or closed loops. More complicated attractors cannot occur. 
To demonstrate this, suppose that x{t) is a dense trajectory in an attractor and 
that for t near ti it runs close to, but distinct from, its path when t is near t\. 
Assuming that the trajectories vary smoothly, the directions of xif) at t\ and 
ti are almost parallel (see figure 13.11). Thus for t > ti the trajectory x(t) is 
'blocked' from ever getting too close to xih) so that xih) cannot in fact be 
a point on an attractor. (The precise formulation of this fact is known as the 
Poincare-Bendixson theorem.) 

Consequently, to find continuous dynamical systems with fractal attractors we 
need to look at systems in 3 or more dimensions. Linear differential equations 
(with fix) a linear function of x in (13.7)) can essentially be solved completely 
by classical methods, the solutions involving periodic or exponential terms. How- 
ever, even simple non-linear terms can lead to trajectories of a highly intricate 
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x(h) 




x(t 2 ) 




Figure 13.11 A trajectory of a continuous dynamical system in the plane. Assuming that 
the trajectories vary smoothly, the trajectory shown is 'cut off from returning too close 
to x(h) any time after t 2 



form. Non-linear differential equations, particularly in higher dimensions, are 
notoriously difficult to analyse, requiring a combination of qualitative mathe- 
matical analysis and numerical study. One standard approach is to reduce a 
3-dimensional continuous system to a 2-dimensional discrete system by looking 
at plane cross sections or Poincare sections as they are called. If P is a plane 
region transverse to the trajectories, we may define the first return map g: P — > P 
by taking g(x) as the point at which the trajectory through x next intersects P; 
see figure 13.12. Then g is a discrete dynamical system on P. If g has an attrac- 
ted E in P it follows that the union of trajectories through the points of E is an 
attractor F of /. Locally F looks like a product of E and a line segment, and 
typically dimn^ = 1 + dime^, by a variation on Corollary 7.4. 

Perhaps the best known example of a continuous dynamical system with a frac- 
tal attractor is the Lorenz system of equations. Lorenz studied thermal convection 
of a horizontal layer of fluid heated from below: the warm fluid may rise owing to 




x(t) 



Figure 13.12 A continuous dynamical system in 
on the plane P by the 'first return' map g 



I 3 induces a discrete dynamical system 
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Cool upper boundary 




Warm lower boundary 



Figure 13.13 The Lorenz equations describe the behaviour of one of the rotating cylin- 
drical rolls of heat-conducting viscous fluid 



its buoyancy and circulate in cylindrical rolls. Under certain conditions these cells 
are a series of parallel rotating cylindrical rolls; see figure 13.13. Lorenz used the 
continuity equation and Navier-Stokes equations from fluid dynamics, together 
with the heat conduction equation to describe the behaviour of one of these rolls. 
A series of approximations and simplifications lead to the Lorenz equations 



X — o(y — x) 
y — rx — y — xz 
z — xy — bz. 



(13.8) 



The term x represents the rate of rotation of the cylinder, z represents the 
deviation from a linear vertical temperature gradient, and y corresponds to the 
difference in temperature at opposite sides of the cylinder. The constant a is 
the Prandtl number of the air (the Prandtl number involves the viscosity and 
thermal conductivity), b depends on the width-to-height ratio of the layer, and r 
is a control parameter representing the fixed temperature difference between the 
bottom and top of the system. The non-linearity in the second and third equations 
results from the non-linearity of the equations of flow. 

Working in (x, y, z)-space, the first thing to notice is that the system (13.8) 
contracts volumes at a constant rate. The differences in velocity between pairs 
of opposite faces of a small coordinate box of side 8 are approximately 
S(dx/dx), S(dy/dy), S(dz/dz), so the rate of change of volume of the box 
is S 3 ((dx/dx) + (dy/dy) + (dz/dz)) = -(a + b+ 1)<5 3 < 0. Nevertheless, with 
a — 10, b — |, r — 28 (the values usually chosen for study) the trajectories are 
concentrated onto an attractor of a highly complex form. This Lorenz attractor 
consists of two 'discs' each made up of spiralling trajectories (figure 13.14). 
Certain trajectories leave each of the discs almost perpendicularly and flow into 
the other disc. If a trajectory x(t) is computed, the following behaviour is typical. 
As t increases, x{t) circles around one of the discs a number of times and then 
'flips' over to the other disc. After a few loops round this second disc, it flips 
back to the original disc. This pattern continues, with an apparently random 
number of circuits before leaving each disc. The motion seems to be chaotic; in 
particular points that are initially close together soon have completely different 
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Figure 13.14 A view of the Lorenz attractor for a = 10, b = I, r = 28. Note the 
spiralling round the two discs and the 'jumps' from one disc to the other 



patterns of residence in the two discs of the attractor. One interpretation of this 
sensitive dependence on initial conditions is that long-term weather prediction 
is impossible. 

The Lorenz attractor appears to be a fractal with numerical estimates suggesting 
a dimension of 2.06 when a — 10, b — |, r — 28. 



Other systems 
equations 



of differential equations also have fractal attractors. The 



x — — y — z 
y — x + ay 
z — b + z(x — c) 

were studied by Rossler. Fixing suitable b and c, the nature of the attractor 
changes as a is varied. When a is small the attractor is a simple closed curve, 
but on increasing a this splits into a double loop, then a quadruple loop, and so 
on. Thus a type of period doubling takes place, and when a reaches a critical 
value there is a fractal attractor in the form of a band (figure 13.15). The band 
has a twist in it, rather like a Mobius strip. 




Figure 13.15 A view of the Rossler band attractor for a = 0.173, b = 0.4, c = 8.5. Note 
the banding, suggestive of a set that is locally the product of a Cantor set and a line segment 
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To a large extent, each continuous dynamical system must be studied individ- 
ually; there is little general theory available. Attractors of continuous systems are 
well suited to computer study, and mathematicians are frequently challenged to 
explain 'strange' attractors that are observed on computer screens. 



*13.6 Small divisor theory 

There are a number of important dynamical systems dependent on a parameter 
co, which are, in some sense, stable provided that co is 'not too close to a rational 
number', in other words if co is badly approximable in the sense of Section 10.3. 
By Jarnfk's theorem 10.3 the well-approximable numbers form fractal sets, so 
the stable parameters lie in sets with fractal complement. 

The following simple example indicates how badly approximable parameters 
can result in stable systems. 

Let C be the infinite cylinder of unit radius {(9, y) : ^ 8 < 2n, — oo < y < 
oo}. Fix co e U. and define a discrete dynamical system / : C -¥ C by 



f(d,y) = (d + 2n co (mod 2jt), v). 



(13.9) 



Clearly, / just rotates points on the cylinder through an angle 2jzco, and the 
circles y — constant are invariant under /. It is natural to ask if these invariant 
curves are stable — if the transformation (13.9) is perturbed slightly, will the 
cylinder still be covered by a family of invariant closed curves (figure 13.16)? 





(a) 



(b) 



Figure 13.16 (a) Invariant circles for the mapping f(0,y) = (8 + 27zco(mod2jz), y). 
(b) If co is not 'too rational', a small perturbation of the mapping to f(6,y) = (0 + 
2n co (mod 2n), y + g(8)) distorts the circles to a family of smooth invariant curves 
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The surprising thing is that this depends on the nature of the number co: if co is 
'sufficiently irrational' then invariant curves remain. 
We modify transformation (13.9) to 

f(6,y) = (9 + 2nco(mod2jr), y + g(9)) (13.10) 

where g is a C°° function (i.e. has continuous derivatives of all orders). It is 
easy to show, using integration by parts, that a function is C°° if and only if its 
Fourier coefficients a k converge to faster than any power of k. Thus, if 

oo 

g{9) = Y j a k e ke 

— oo 

is the Fourier series of g, then for every positive integer m there is a constant c 
such that for k ^ 

\a k \^c\k\~ m . 

Suppose that y — y(9) is an invariant curve under (13.10), and suppose that y 
has Fourier series 

oo 

y(e) = J2 b ^ ke - 

—00 

The point (9 + 2jtco (mod 2n), y (9) + g (9)) must lie on the curve whenever 
(9, y(9)) does; hence 

y(9 + 27r<w(mod In)) = y{9) + g{9) 

or 

oo oo oo 

J2 b k e k{6+1 ^ = J2 hJ ke + Yl a ^ lke - 

— oo — oo — oo 

Equating terms in & ke we get that ao — and bo is arbitrary, and 

(k^ 0) 



provided co is irrational. Thus the invariant curves are given by 

a k jke 



yw = *> + !!<*£-[#' ("-id 



provided that this Fourier series converges to a continuous function. This will 
happen if the denominators e 2lTlka) — 1 are not 'too small too often'. Suppose 
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that co is not a-well-approximable for some a > 2; see (10.4). Then there is a 
constant C\ such that 

| e 2 Jr ito_ j| ^ m i n \koo-m\ = \\kco\\ ^ c x \k\ x - a 



for all k ^ 0, so 



a* 



- 1 



, 1*1 



1 lytl 1 -" 

^cc 1 " 1 i^r m - 1+a 



for some constant c for each m. Thus if g is a C°° function and co is not a-well- 
approximable for some a > 2, the function y(0) given by (13.11) is C°°, so that 
/ has a family of C°° invariant curves. We saw in Theorem 10.3 that the set 
of numbers that are a-well-approximable for all a > 2 has dimension 0, so for 
'most' a> the invariant curves are stable. 

The above example is a special case of a much more general class of trans- 
formations of the cylinder known as twist maps. Define / : C — > C by 

f{9, y) = (9 + 27toj(y)(mod2jT), y). (13.12) 

Again the circles y — constant are invariant, but this time the angle of rotation 
co(y) is allowed to vary smoothly with y. We perturb / to 

f{6, y) = (9 + 2jrco(y) + eh{9, y)(mod2jr), y + eg(9, y)) (13.13) 

where h and g are smooth functions and e is small, and ask if the invariant 
curves round C are preserved. Moser's twist theorem, a very deep result, roughly 
says that the invariant circles y = constant of (13.12), for which a>(y) — co, will 
deform into differentiable closed invariant curves of (13.13) if e is small enough, 
provided that \\kco\\ ^ ci\k\~ 3 ^ 2 for all k ^ for some constant C\. Thus the 
exceptional set of frequencies co has dimension |, by Theorem 10.3. Typically 
C is filled by invariant curves corresponding to badly approximable co, where 
the motion is regular, and regions in between where the motion is chaotic. The 
chaotic regions grow as e increases. 

Perhaps the most important application of small divisor theory is to the sta- 
bility of Hamiltonian systems. Consider a 4-dimensional space parametrized by 
(01, 02, j\, jz)- A Hamiltonian function H{9\, 9^, j\, ji) determines a conserva- 
tive (volume-preserving) dynamical system according to the differential equations 

0! = dH/dj x 02 = 9///3J2 h = -9/Z/30! h = -9Z//302. 
Thus if H{9\, 02, j\, ji) — Hq(Ji, 72) is independent of 9\, 02, we get the solution 

01=ft>lf + Cl 2 = &>2f + C 2 jl — C 3 J2 = C4 
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where a>\ and a>2 are angular frequencies (which may depend on j\ , jj) and 
ci,...,6'4 are constants. A trajectory of the system remains on the same 2- 
dimensional torus, (j 1; ji) — constant, for all time; such tori are called invariant. 
It is important to know whether such invariant tori are stable under small 
perturbations of the system. If the Hamiltonian is replaced by 

H UuJ2) + sH 1 (d l ,9 2 JuJ2) 

where e is small, do the trajectories of this new system stay on new invariant 
tori expressible as (j[, j'^) — constant, after a suitable coordinate transformation 
(01, 02 , j\, ji) !->■ (0p 02' J'v Ji^ I n om er words, do the invariant tori of the orig- 
inal system distort slightly to become invariant tori for the new system, or do 
they break up altogether? The celebrated Kolmogorov-Arnold-Moser (KAM) 
theorem gives an answer to this question — essentially a torus is stable under 
sufficiently small perturbations provided that the frequency ratio o)\/o)2 is badly 
approximable by rationals; more precisely it is stable if for some c > we have 
|(o)i/a)2) — (p/q)\ ^ c/q 5 / 2 for all positive integers p, q. The set of numbers that 
fails to satisfy this condition is a fractal of dimension | by Theorem 10.3, so, in 
particular, almost all frequency ratios (in the sense of Lebesgue measure) have 
tori that are stable under sufficiently small perturbations. (In fact, the condition 
can be weakened to 1(0)1/0)2) — (p/q)\ ^ c l<l a f° r an Y ot > 2.) 

There is some astronomical evidence for small divisor theory. For example, the 
angular frequencies o) of asteroids tend to avoid values for which the ratio o)/o)j 
is close to p/q where q is a small integer, where co s is the angular frequency 
of Jupiter, the main perturbing influence. On the assumptions that orbits in the 
solar system are stable (which, fortunately, seems to be the case) and that we can 
consider a pair of orbiting bodies in isolation (a considerable oversimplification), 
this avoidance of rational frequency ratios is predicted by KAM theory. 



*13.7 Liapounov exponents and entropies 

So far we have looked at attractors of dynamical systems largely from a geometric 
point of view. However, a dynamical system / provides a much richer structure 
than a purely geometric one. In this section we outline some properties of / that 
often go hand in hand with fractal attractors. 

The concept of invariant measures is fundamental in dynamical systems theory. 
A measure /x on D is invariant for a mapping / : D — > D if for every subset A 
of D we have 

H(f-HA)) = lx(A). (13.14) 

We assume that ji has been normalized so that fi(D) — 1. Any attractor F sup- 
ports at least one invariant measure: for fixed x in the basin of attraction of F 
and A a Borel set, write 

1 , 

li(A) = lim — #{k :l^k^m and/*(jc) e A} (13.15) 
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for the proportion of iterates in A. It may be shown using ergodic theory that 
this limit exists and is the same for /x-almost all points in the basin of attraction 
under very general circumstances. Clearly, /x(A UB) = /J-(A) + fi(B) if A and 
B are disjoint, and f k (x) e A if and only if f k ~ 1 (x) e / _1 (A), giving (13.14). 
The measure (13.15) is concentrated on the set of points to which f k (x) comes 
arbitrarily close infinitely often; thus /x is supported by an attractor of /. The 
measure /z(A) reflects the proportion of the iterates that lie in A, and may be 
thought of as the distribution that is seen when a large number of iterates f k (x) 
are plotted on a computer screen. The intensity of the measure can vary widely 
across the attractor A; this variation is often analysed using multifractal analysis 
(see Chapter 17). As far as the size of an attractor is concerned, it is often the 
dimension of the set occupied by the invariant measure /x that is relevant, rather 
than the entire attractor. With this in mind, we define the Hausdorff dimension 
of a measure \i for which [i(D) — 1 as 

dim H /x = inf{dim H Zi : E is a Borel set with /U.(£) > 0}. (13.16) 

If ijl is supported by F then clearly dimnM ^ dmin^, but we may have strict 
inequality; see Exercise 13.10. However, if there are numbers s > and c > 
such that for every set U 

li(U) ^ c\U\ s (13.17) 

then the Mass distribution principle 4.2 implies that for each set E with < ijl{E) 
we have H S (E) ^ fi(E)/c > 0, so that dime£ ^ s. Hence if (13.17) holds then 

diiriH/x > s. (13.18) 

Once / is equipped with an invariant measure \x several other dynamical 
constants may be defined. For simplicity, we assume that D is a domain in IR 2 
and / : D — »■ D is differentiable. The derivative (f k )'(x) is a linear mapping on 
IR 2 ; we write a k {x) and bi(x) for the lengths of the major and minor semi-axes 
of the ellipse (f k )'(x)(B) where B is the unit ball. Thus the image under /* of 
a small ball of radius r and centre x approximates to an ellipse with semi-axes 
of lengths rcik(x) and rbt{x). We define the Liapounov exponents as the average 
logarithmic rate of growth with k of these semi-axes: 

Xi(x) = lim -\oga k (x) k 2 {x) = lim -logb k (x). (13.19) 

Techniques from ergodic theory show that if /x is invariant for /, these exponents 
exist and have the same values X 1,^2 for /x-almost all x. Hence in a system with 
an invariant measure, we refer to X\ and A2 as the Liapounov exponents of the 
system. The Liapounov exponents represent the 'average' rates of expansion of 
/. If B is a disc of small radius r, then f k (B) will 'typically' be close to an 
ellipse with semi-axes of lengths re l[k and re Xlk \ see figure 13.17. 

A related dynamical idea is the entropy of a mapping /:£)—>•/). Write 

V(x, e,k) = {y e D: \f(x) - f(y)\ < s for ^ i ^ k} (13.20) 
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Figure 13.17 The definition of the Liapounov exponents X] and X2 

for the set of points with their first k iterates within e of those of x. If /x is an 
invariant measure for /, we define the fi-entropy of / as 



M/) = limjim_ 



--log[x,(V(x,e,k)) 

k 



(13.21) 



Under reasonable conditions, this limit exists and has a constant value for /U,- 
almost all x. The entropy h^if) reflects the rate at which nearby points spread 
out under iteration by /, or alternatively the amount of extra information about 
an orbit {/*(jc)}i£=i that is gained from knowing the position of an additional 
point on the orbit. 

The baker's transformation (13.4) provides a simple illustration of these ideas 
(the line of discontinuity makes little difference). There is a natural invariant 
mass distribution fj. on the attractor F such that each of the 2 k strips of E^ has 
mass 2~ k , with this mass spread uniformly across the width of the E. Just as 
in Example 4.3 we get that fi(U) ^ c\U\ s where s = 1 + log2/(— log A) so by 
(13.17) and (13.18) s < dim H fx ^ dim H F = s. 

The Liapounov exponents are also easily found. The derivative of (13.4) is 



/'(*. y) = 



(provided x ^ i) so (f k )'(x, y) — 



2 k 






k k 



(except where 



x — p/2 k for non-negative integers p and k). Hence a^(x, y) — 2 k and b^(x, y) — 
k k . By (13.19) X\ix, y) — log 2, A.2(jc, y) — logA for /x-almost all (x, y), and the 
Liapounov exponents of the system are X\ — log 2 and X2 — log A. 

Since /* stretches by a factor 2 k horizontally and contracts by a factor X k 
vertically, we get, using (13.20) and ignoring the 'cutting' effect of /, that 
V((x, y), s, k) is approximately a rectangle with sides 2~ k s and e, which has 
//-measure approximately e s 2~ k , if (x, y) e F. Thus 



M/) = lim lim 



■ilog(e s 2- K )| -log 2. 
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The dimensions, Liapounov exponents and entropies of an invariant measure 
of a given dynamical system can be estimated computationally or experimentally 
and are often useful when comparing different systems. However, the very nature 
of these quantities suggests that they may not be completely independent of each 
other. One relationship that has been derived rigorously applies to a smooth bijec- 
tive transformation / on a 2-dimensional surface. If \i is an invariant measure 
for / with Liapounov exponents X\ > > X 2 then 

dim H m = m/) (r - r) • (13 - 22) 

It is easily seen that the exponents calculated for the baker's transformation 
satisfy this formula. 

The following relationship holds in many cases: if / is a plane transformation 
with attractor F and Liapounov exponents X t > > X 2 , then 

dim B F ^ 1 - (Xi/k 2 ). (13.23) 

An argument to support this runs as follows. Let N$(F) be the least number of 
discs of radius S that can cover F. If {£/,} are Ng(F) such discs, then f k (F) 
is covered by the N$(F) sets /*((/,) which are approximately elliptical with 
semi-axis lengths Sexp(Xik) and 8exp(X 2 k). These ellipses may be covered by 
about exp((li — X 2 )k) discs of radii 8exp(X 2 k). Hence 

N Sexp(Mk) (F) ^ exp((Ai - X 2 )k)N s (F) 

so 

logN Sexv(Xlk) (F) < logjexpdk! - l 2 )k)N s (F)) 
— log(<5 exp(X 2 k)) — log(<5 expCA^A:)) 

_ (X l -X 2 )k + log N S (F) 
-X 2 k - log<5 

Letting k — > 00 gives dim B F ^ 1 — (X\/X 2 ). This argument can often be justi- 
fied, but it assumes that the Liapounov exponents are constant across the domain 
D, which need not be the case. 

The relationships between these and other dynamical parameters are complex, 
being closely interrelated with the chaotic properties of / and the fractal nature 
of the attractor. 

The theory of multifractal measures has been introduced to analyse mea- 
sures such as the invariant measures of dynamical systems. This is discussed 
in Chapter 17. 

13.8 Notes and references 

The literature on dynamical systems is vast. Broad accounts that relate to topics 
discussed in this chapter may be found in the books by Guckenheimer and Holmes 
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(1993), Schuster (1984), Thompson and Stewart (1986), Devaney (1989), Hale 
and Kocak (1991), Peitgen, Jiirgens and Saupe (1992), Drazin (1992) and Katok 
and Hasselblatt (1995). More computational approaches are given by Stuart and 
Humphries (1996) and Nusse and Yorke (1998). The collections of papers edited 
by Cvitanovic (1989), Holden (1986), Bedford and Swift (1988) and Ledrappier, 
Lewowicz and Newhouse (1996) highlight a variety of related aspects. 

For more details on symbolic dynamics, see Bedford, Keane and Series (1991), 
Lind and Marcus (1995) and Pesin (1997). For accounts of the logistic map, see 
May (1976), and most of the books mentioned above. The horseshoe attractor was 
introduced in the fundamental paper by Smale (1967) and the Henon attractor in 
Henon and Pomeau (1976). The book by Sparrow (1982) contains a full account 
of the Lorenz equations. 

The main theory and applications of small divisor theory are brought together 
in the collected papers on Hamiltonian dynamical systems edited by MacKay 
and Meiss (1987). For results relating Liapounov exponents to dimensions, see 
the papers by Young (1982), Frederickson, Kaplan, Yorke and Yorke (1983), the 
papers in Mayer-Kress (1986), and the books by Temam (1997) and Pesin (1997). 



Exercises 

13.1 Find a fractal invariant set F for the 'tent-like' map /:!->! given by f(x) = 
2(1 — \2x — 1]). Show that F is a repeller for / and that / is chaotic on F, What 
is dimnf? 

13.2 Let / : R -> R be given by 

f 5x (x sC 1) 

f(x) = | 10-5* (1 <x < 2) 

\ 5x - 10 (x> 2) 

Determine an IFS 51,52, 5 3 : [0, 5] -*■ [0, 5] such that /(S,(*)) = x for each ('. 
Show that the attractor F of this IFS is a repeller for /, and determine the Hausdorff 
and box dimension of F. 

13.3 Let f A : [0, 1] — > K be given by fx(x) = Asinn'x. Show that for X sufficiently 
large, fx has a repeller F, in the sense that if x £ F then //(*) 4- [0> 1] for some 
positive integer k. Find an IFS which has F as its attractor, and thus estimate 
dim H -F for large X (see Example 9.8). 

13.4 Investigate the iterates f*(x) of x in [0, 1] under the logistic mapping (13.2) for 
various values of X and initial points x. Show that if the sequence of iterates 
converges then it converges either to or to 1 — l/X. Show that if X = h then, 
for all x in (0, 1), the iterates converge to 0, but that if X = 2 they converge to 
i. Show that if X = 4, then there are infinitely many values of x in (0, 1) such 
that fx(x) converges to 0, infinitely many x in (0, 1) for which f } k (x) converges 
to |, and infinitely many x in (0, 1) for which /*(*) does not converge. Use a 
programmable calculator or computer to investigate the behaviour of the orbits for 
other values of X. Investigate other transformations listed at the end of Section 13.2 
in a similar way. 
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13.5 In the cases X = 2 and X = 4 it is possible to obtain a simple formula for the 
iterates of the logistic map f- L on [0, 1]. For a given x = xq we write Xk = f*(x). 

(i) Show that if X = 2 and a is chosen so that x = j (1 — ex P fl )> then the iterates 
are given by jcj = ^(1 — exp(2 /: a)). 

(ii) Show that if X = 4 and < a < 1 is chosen so that x = sin (na), then jt* = 
sin (2 k na). By writing a = • flifl2 • • • in binary form, show that f$ has an 
unstable orbit of period p for all positive integers p and also has a dense orbit. 

13.6 Consider the modified baker's transformation f : E —*■ E, where £" is the unit 
square, given by 

fix y)=\ (2X ' Xy) (0<*<i) 

'■" I (2* - 1, /*y + i) (±<x<l) 

where < X, /i < j. Show that there is a set F that attracts all points of E, and 
find an expression for the Hausdorff dimension of F. 

13.7 Consider the Henon mapping (13.5) with a = 1.4 and b = 0.3. Show that 
the quadrilateral D with vertices (1.32, 0.133), (-1.33, 0.42), (-1.06, -0.5) and 
(1.245, —0.14) is mapped into itself by /. Use a computer to plot the iterates of a 
typical point in D. 

13.8 With notation as in Section 13.4 consider the transformation / of the solid torus 
D given by 

f(4>, w) = (30(mod 2n), aw + j(j>) 

where < a < -r^ . Show that / has an attractor F of Hausdorff and box dimensions 
equal to 1 + log 3/ — log a, and verify that / is chaotic on F. 

13.9 Let g : K -*■ R be a differentiable bounded function, and let h : R 2 -*■ R 2 be 
given by 

h(t,x) = (Xt,X 2 - s (x- g(t))) 

where X > 1 and < s < 2. Show that graph / is a repeller for h, where / is 
the function 



/(0 = £> ( *- 2) *g(A*0. 



Thus functions of Weierstrass type (see (11.7)) can occur as invariant sets in dynam- 
ical systems. 

13.10 Give an example of a mass distribution fi on [0, 1] for which dim H /x < dim H F, 
where F is the support of fi. (Hint: see Section 10.1.) 

13.11 Consider the mapping /:£—>■£', where E is the unit square, given by 

f(x, y) = (x + y(mod 1), x + 2y(mod 1)). 

(This mapping has become known as Arnold's cat map.) Show that plane Lebesgue 
measure is invariant for / (i.e. / is area-preserving), and find the Liapounov 
exponents of /. 
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13.12 Write a computer program that plots the orbits of a point x under iteration by 
a mapping of a region in the plane. Use it to study the attractors of the baker's 
transformation, the Henon mapping and experiment with other functions. 

13.13 Write a computer program to draw trajectories of the Lorenz equations (13.8). See 
how the trajectories change as a, r and b are varied. Do a similar study for the 
Rossler equations. 



Chapter 14 Iteration of complex 
functions — Julia sets 



Julia sets provide a most striking illustration of how an apparently simple pro- 
cess can lead to highly intricate sets. Functions on the complex plane C as 
simple as f(z) — z 1 + C, with c a constant, can give rise to fractals of an exotic 
appearance — look ahead to figure 14.7. 

Julia sets arise in connection with the iteration of a function of a complex 
variable /, so are related to the dynamical systems discussed in the previous 
chapter — in general a Julia set is a dynamical repeller. However, by specializing 
to functions that are analytic on the complex plane (i.e. differentiable in the sense 
that f'(z) — \im w -^o(f (z + w) — f(z))/w exists as a complex number, where 
z, w e C) we can use the powerful techniques of complex variable theory to 
obtain much more detailed information about the structure of such repelling sets. 



14.1 General theory of Julia sets 

For convenience of exposition, we take / : C — >■ C to be a polynomial of degree 
n ^ 2 with complex coefficients, f(z) — a n z" + a„~iz" _1 + • • • + ao. Note that 
with minor modifications the theory remains true if / is a rational function 
f(z) — p(z)/q(z) (where p, q are polynomials) on the extended complex plane 
C U {oo}, and much of it holds if / is any meromorphic function (that is, a 
function that is analytic on C except at isolated poles). 

As usual we write /* for the A:-fold composition /o • • • of of the function /, 
so that f k (w) is the kth iterate /(/(• ■ ■ (f(w)) ■ ■ ■)) of w. 

Julia sets are defined in terms of the behaviour of the iterates /* (z) for large 
k. First, we define the filled- in Julia set of the polynomial /, 

K(f) = (zgC: f k (z) A ex)}. 

The Julia set of / is the boundary of the filled-in Julia set, J(f) — dK(f). (We 
write K for K(f) and J for /(/) when the function is clear.) Thus z e J if) 
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if in every neighbourhood of z there are points w and v with f k (w) — > oo and 
f k (v) -^ OO. 

The complement of the Julia set is called the Fatou set or stable set F(f). This 
chapter investigates the geometry and structure of the Julia sets of polynomials; 
in particular J is usually a fractal. 

For the simplest example, let f(z) — z 2 , so that f k (z) — z 2 . Clearly f k (z) — > 
as k — > oo if \z\ < 1 and f k (z) — > oo if \z\ > 1, but with f k (z) remaining on the 
circle \z\ — 1 for all k if \z\ — 1. Thus, the filled-in Julia set k is the unit disc 
\z\ ^ 1, and the Julia set J is its boundary, the unit circle, \z\ — 1. The Julia set J 
is the boundary between the sets of points which iterate to and oo. Of course, in 
this special case J is not a fractal. 

Suppose that we modify this example slightly, taking f(z) — z 2 + c where c 
is a small complex number. It is easy to see that we still have f k (z) — > w if z 
is small, where w is the fixed point of / close to 0, and that /* (z) — > oo if z is 
large. Again, the Julia set is the boundary between these two types of behaviour, 
but it turns out that now J is a fractal curve; see figure 14.1. 

We shall need some terminology about fixed and periodic points of /. Recall 
that if f(w) — w we call w & fixed point of /, and if f p {w) — w for some integer 
p ^ 1 we call w a periodic point of /; the least such p is called the period of 
w. We call w, f(w), . . . , f p (w) a period p orbit. Let w be a periodic point of 
period p, with (f p )'(w) — X, where the prime denotes complex differentiation. 
The point w is called attractive if 5j \X\ < 1, in which case nearby points are 
attracted to the orbit under iteration by /, and repelling if \k\ > 1, in which case 
points close to the orbit move away. The study of sequences f k (z) for various 
initial z is known as complex dynamics. The position of z relative to the Julia set 
/(/) is a key to this behaviour. 





(«) 



(b) 



Figure 14.1 (a) The Julia set of f(z) = z 2 is the circle \z\ = 1, with the iterates f k (z) 







if z is inside 7, and \f(z)\ —*■ oo if z is outside /. (b) If / is perturbed to the function 
f(z) = z 2 + c for small c this picture distorts slightly, with a curve J separating those 
points z for which /* (z) converges to the fixed point w of f near from those points z 
with \f k (z)\ —> oo. The curve J is now a fractal 
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The following lemma is extremely useful in determining whether a sequence 
iterates to infinity, that is whether points are outside the filled-in Julia set. 

Lemma 14.1 

Given a polynomial f(z) — a n z n + a n _\z n ~ l -\ + Oq with a„ ^ 0, there exists 

a number r such that if \z\ ^ r then |/(z)| ^ 2|z|. In particular, if \f m (z)\ ^ 
r for some m ^ then f k (z) — >• oo as k —¥■ oo. Thus either f k (z) — > oo or 
{f k (z) : k = 0, 1, 2, . , .} is a bounded set. 

Proof. We may choose r sufficiently large to ensure that if \z\ ^ r then i|a„||z|" > 
2\z\ and (|fl B _i||z|"- 1 + ■ ■ ■ + l«il|z| + Kl) < \\a n \\z\ n . Then if \z\ > r, 

\f(z)\ > \a n \\z\ n - (K-illzr 1 + ■ ■ ■ + IfliHzl + |aol) 

Furthermore, if \f m (z)\ ^ r for some m, then, applying this inductively, we get 

\f m+k (z)\ ^2 m \f k {z)\ ^r, so /*(*)-► oo. D 

Some basic observations on the structure of the filled-in Julia set and the Julia 
set follow easily. 

Proposition 14.2 

Let f(z) be a polynomial. Then the filled in Julia set K{f) and the Julia set 
J if) are non-empty and compact, with J(f) C K(f). Furthermore, J if) has 
an empty interior. 

Proof. With r given by Lemma 14.1, it is immediate from the lemma that K is 
contained in the disc 5(0, r) and so is bounded, as is its boundary J . 

If z £ K, then f k (z) — > oo, so \f m {z)\ > r for some integer m. By continuity 
°f / m > \f m ( w )\ > r f° r ai l w in a sufficiently small disc centred at z, so for such 
w, f k (w) — > oo by Lemma 14.1, giving that w <£ K . Thus, the complement of 
K is open, so K is closed. As the boundary of K, the Julia set J is closed and 
contained in K. Thus K and J are closed and bounded, and so are compact. 

The equation f(z) — z has at least one solution zq, say, so f k {zo) — Zo for all 
k, so zo e K and K is non-empty. Let z\ € <C\K . Then the point kzo + (1 — ^)z\ 
on the line joining zq and z\ will lie on the boundary of K for some ^ X $C 1; 
taking X as the infimum value for which Xzq + (1 — X)zi G K will do. Thus, 
J — dK is non-empty. 

Finally, if U is a non-empty open subset of J C K, then U lies in the interior 
of K and therefore has empty intersection with its boundary J, a contradic- 
tion. □ 

Hardly surprisingly, /(/) gets mapped onto itself by / and its inverse. 
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Proposition 14.3 

The Julia set J — J(f) of f is forward and backward invariant under f i.e. J — 
f{J) = f-\j). 

Proof. Let z e J ■ Then f k (z) ■/*■ oo, and we may find w„ — > z with f k {w n ) — > 
oo as k — >■ oo for all n. Thus, f k {f{z)) ■/*■ oo and f k (f(w„)) — >■ oo, where by 
continuity of /, f(w„) can be chosen as close as we like to /(z). Thus f(z) e J, 
so f{J) C J which also implies J C f~ l (f(J)) C f~\J). 

Similarly, with z and w n as above, if f(zo) — z then we may find v n — > zo with 
/(u„) = w„, by the mapping properties of polynomials on C. Hence, f k (zo) — 
f k ~ 1 {z) -f> oo and f k (v„) — f k ~ l (w n ) —>■ oo as k -*■ oo, so zo e 7. Thus, 
f~ X {J) C 7 which implies J = f(f~ l (J)) C f(J). U 



Proposition 14.4 

J(f p ) — J(f) for every positive integer p. 

Proof. It follows from Lemma 14.1 that f k {z) ->■ oo if and only if (f p ) k (z) — 
f kp (z) — > oo. Thus / and f p have identical filled-in Julia sets and so identical 
Julia sets. □ 

To develop the theory of Julia sets much further, we cannot avoid introducing 
some technical tools from complex variable theory, in particular normal families 
of analytic functions and Montel's theorem. 

[*Readers who wish to omit the quite technical arguments involving complex 
variable theory should skip to Summary 14.12.] 

Let U be an open subset of C, and let gk '■ U — > C, k = 1, 2, . . . be a family 
of complex analytic functions (i.e. functions differentiable on U in the complex 
sense). The family {g^} is said to be normal on U if every sequence of func- 
tions selected from {g^} has a subsequence which converges uniformly on every 
compact subset of U, either to a bounded analytic function or to oo. Notice that 
by standard complex variable theory, this means that the subsequence converges 
either to a finite analytic function or to oo on each connected component of 
U . In the former case, the derivatives of the subsequence must converge to the 
derivative of the limit function. The family {g^} is normal at the point w of U if 
there is some open subset V of U containing w such that {g^} is a normal family 
on V. Observe that this is equivalent to there being a neighbourhood V of w on 
which every sequence [gk] has a subsequence convergent to a bounded analytic 
function or to oo. 

The key result which we will use repeatedly in our development of Julia sets 
is the remarkable theorem of Montel, which asserts that non-normal families of 
functions take virtually all complex values. 
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Montel's theorem 14.5 

Let {gt} be a family of complex analytic functions on an open domain U. If {gt} 
is not a normal family, then for all w e C with at most one exception we have 
g k (z) — w for some z e U and some k. 

Proof. Consult the literature on complex function theory. □ 

Montel's theorem quickly leads to the following characterization of a Julia set, 
which is often taken to be the definition in more advanced work, since it is more 
widely applicable and lends itself to complex variable techniques. 

Proposition 14.6 

/(/) = {z e C : the family {/*} is not normal at z}. (14.1) 

Proof. If z e J , then in every neighbourhood V of z there are points w such that 
f k (w) — >■ oo, whilst f k (z) remains bounded. Thus no subsequence of {f k } is 
uniformly convergent on V , so that {f k } is not normal at z. 

Suppose that z £ J ■ Either z £ mtK, in which case, taking an open set V with 
z S V C intK, we have f k (w) e K for all w e V and all k, so by Montel's the- 
orem 14.5 {/*} is normal at w. Otherwise, z e <C\K, so \f k (z)\ > r for some k, 
where r is given by Lemma 14.1, so \f k (w)\ > r for all w in some neighbour- 
hood V of z, so by Lemma 14.1, f k (w) — >■ oo uniformly on V , so again {f k } is 
normal at w. □ 

The expression (14.1) is used as the definition of the Julia set for general 
complex functions, such as rational functions or meromorphic functions. Taking 
this as the definition, the theory developed from Montel's theorem extends to a 
wide class of complex functions. Note, however, that if / : C U {oo} —> C U {oo} 
is a rational function, then J must be closed, but need not be bounded. Indeed, 
it is possible for J to be the whole complex plane; for example, if f(z) — 
((z - 2)/z) 2 . 

A main aim now is to obtain a further characterization of the Julia set /(/) 
as the closure of the repelling periodic points of /. On the way we encounter 
further interesting properties of Julia sets; for example, our next result tells that 
/ is 'mixing' near /(/), that is neighbourhoods of points of /(/) are spread 
right across C by iterates of /. 

Lemma 14.7 

Let f be a polynomial, let w e /(/) and let U be any neighbourhood ofw. Then 
for each 7 = 1,2,..., the set W = U*l; f k (U) w the whole of C, except possi- 
bly for a single point. Any such exceptional point is not inJ(f), and is independent 
ofw and U. 

Proof. By Proposition 14.6, the family {f k }^L: is not normal at w, so the first 
part is immediate from Montel's theorem 14.5. 
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Suppose v £ W. If /(z) = v, then, since f(W) C W, it follows that z ^ W. 
As C\VK consists of at most one point, then z — v. Hence / is a polynomial of 
degree n such that the only solution of /(z) — v = is V, which implies that 
f(z) — v — c{z — v) n for some constant c. 

If z is sufficiently close to v, then f k (z) — V — > as k — > oo, convergence 
being uniform on, say, {z : |z — d| < (2c) _1 / ( " _1) }. Thus [f k ] is normal at v, so 
the exceptional point v £ J if)- Clearly v only depends on the polynomial /. (In 
fact, if W omits a point v of C, then /(/) is the circle with centre v and radius 
c -i/(»-D.) Q 

The following corollary is the basis of many computer pictures of Julia sets; 
see Section 14.3. 



Corollary 14.8 

(a) The following holds for all z G C with at most one exception: if U is an 
open set intersecting J(f) then f~ k (z) intersects U for infinitely many 
values ofk. 

(b) If ze J(f) then /(/) is the closure of U~ , /"*(*)• 

Proof. 

(a) Provided z is not the exceptional point of Lemma 14.7, z e f k (U), and 
thus f~ k (z) intersects U, for infinitely many k. 

(b) If z G /(/) then /"*(z) C /(/), by Proposition 14.3, so that \J™ =1 f- k (z) 
and, therefore, its closure is contained in the closed set /(/). On the other 
hand, if U is an open set containing z G /(/), then f~ k (z) intersects U 
for some k, by part (a); z cannot be the exceptional point by Lemma 14.7, 
so z is in the closure of Utli f~ k (z)- d 

Proposition 14.9 

/(/) is a perfect set (i.e. closed and with no isolated points) and is therefore 
uncountable. 

Proof. Let v G /(/) and let U be a neighbourhood of v. We must show that U 
contains other points of /(/). We consider three cases separately. 

(i) v is not a fixed or periodic point of /. By Corollary 14. 8(&) and Proposi- 
tion 14.3, U contains a point of f~ k (v) C J(f) for some k ^ 1, and this 
point must be different from v. 

(ii) f(v) — v. If /(z) = v has no solution other than v, then, just as in the 
proof of Lemma 14.7, v £ J(f). Thus, there exists w ^ v with fiw) — v. 
By Corollary 14.8(Zj), U contains a point u of f~ k (w) — f~ k ~ l (v) for 
some k ^ 1. Any such m is in 7(/) by backward invariance and is distinct 
from v, since f k (v) — v ^ w — f k (u). 
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(iii) f p (v) — v for some p > 1. By Proposition 14.4, J if) — J(f p ), so by 
applying (ii) to f p we see that U contains points of J(f p ) — J if) other 
than v. 

Thus J(f) has no isolated points; since it is closed, it is perfect. Finally, every 
perfect set is uncountable, see Exercise 14.1. □ 

We can now prove the main result of this section, that the Julia set J(f) is 
the closure of the repelling periodic points of /. 



Theorem 14.10 

Iff is a polynomial, J (/) is the closure of the repelling periodic points off. 

Proof. Let w be a repelling periodic point of / of period p, so w is a repelling 
fixed point of g — f p . Suppose that {g k } is normal at w; then w has an open 
neighbourhood V on which a subsequence {g ki } converges to a finite analytic 
function go (it cannot converge to oo since g k (w) — w for all k). By a standard 
result from complex analysis, the derivatives also converge, (g k ')'(z) — > g'oiz) if 
z e V. However, by the chain rule, \(g ki )'(w)\ — \(g'(w)) ki \ — > oo since w is a 
repelling fixed point and |g'(w)| > 1. This contradicts the finiteness of g' (w), so 
{g k } cannot be normal at w. Thus w e J(g) — J(f p ) — ■/(/), by Proposition 
14.4. Since J(f) is closed, it follows that the closure of the repelling periodic 
points is in /(/). 

Let E — {w e /(/) such that there exists v ^ w with f(v) — w and f'(v) ^0}. 
Suppose that w e E. Then there is an open neighbourhood V of w on which we 
may find a local analytic inverse / _1 : V —> <C\V so that f~ l (w) — v ^ w (just 
choose values of f~ l {z) in a continuous manner). Define a family of analytic 
functions {h^} on V by 

(f k (z)-z) 



h k (z) 



(f- l (z)-z)' 



Let U be any open neighbourhood of w with U C V. Since w e J '(/) the family 
{/*} and thus, from the definition, the family {A^} is not normal on U. By 
Montel's theorem 14.5, h^(z) must take either the value or 1 for some k and 
z e U. In the first case f k (z) — z for some z e U; in the second case f k (z) — 
f~ l (z) so f (z) — z for some z e U ■ Thus U contains a periodic point of /, 
so w is in the closure of the repelling periodic points for all w e E, 

Since / is a polynomial, E contains all of J if) except for a finite number of 
points. Since J(f) contains no isolated points, by Proposition 14.9, J if) C E 
is a subset of the closure of the repelling periodic points. □ 

If w is an attractive fixed point of /, we write 

A(w) = (zeC:/*(z)^wasit^oo) (14.2) 
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for the basin of attraction of w. We define the basin of attraction of infinity, 
A(oo), in the same way. Since w is attractive, there is an open set V containing 
w in A(w) (if w — oo, we may take {z : \z\ > r], for sufficiently large r). This 
implies that A(w) is open, since if f k (z) G V for some k, then z e /~ (V), 
which is open. The following characterization of J as the boundary of any basin 
of attraction is extremely useful in determining Julia sets. Recall the notation dA 
for the boundary of the set A. 

Lemma 14.11 

Let w be an attractive fixed point off. Then dA(w) — J if). The same is true if 

w — oo. 

Proof. If z e J(f) then f k (z) e J if) for all k so cannot converge to an attractive 
fixed point, and z ^ A(w). However, if U is any neighbourhood of z, the set 
f k (U) contains points of A(w) for some k by Lemma 14.7, so there are points 
arbitrarily close to z that iterate to w. Thus z e A(w) and so z e 8A(w). 

Suppose z e BAiw) but z $ J if)- Then z has a connected open neighbourhood 
V on which {f k } has a subsequence convergent either to an analytic function or 
to oo. The subsequence converges to w on V H A(w), which is open and non- 
empty, and therefore on V, since an analytic function is constant on a connected 
set if it is constant on any open subset. All points of V are mapped into A(w) 
by iterates of /, so V C A(w), contradicting that z e dA(w). □ 

For an example of this lemma, recall the case f(z) — z 2 . The Julia set is the 
unit circle, which is the boundary of both A(0) and A(oo). 
We collect together the main points of this section. 

Summary 14.12 

The Julia set J(f) of the polynomial f is the boundary of the set of points z e C 
such that f k (z) — >■ oo. It is an uncountable non-empty compact set containing 
no isolated points and is invariant under f and / , and /(/) = J(f p ) for each 
positive integer p. If z e /(/), then J(f) is the closure of Utlj f~ k (z)- The 
Julia set is the boundary of the basin of attraction of each attractive fixed point 
off including oo, and is the closure of the repelling periodic points off. 

Proof. This collects together the results of this section. □ 

It is possible to discover a great deal more about the dynamics of / on the Julia 
set. It may be shown that '/ acts chaotically on J' (see Chapter 13). Periodic 
points of / are dense in J but, on the other hand, J contains points z with iterates 
f k (z) that are dense in J. Moreover, / has 'sensitive dependence on initial 
conditions' on J; thus \f k {z) — f k (w)\ will be large for certain k regardless of 
how close z, W e J are, making accurate computation of iterates impossible. 
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14.2 Quadratic functions — the Mandelbrot set 

We now specialize to the case of quadratic functions on C. We study Julia sets 
of polynomials of the form 

fdz) = z 2 + c. (14.3) 

This is not as restrictive as it first appears: if h (z) — az + /3 (a ^ 0) then 

h-\f c (h(z))) = («V + 2ap z + p 2 + c - P)/a. 

By choosing appropriate values of a, ft and c we can make this expression into 
any quadratic function / that we please. Then h~ l o f c oh — f ', so h~ l °f^°h — f k 
for all k. This means that the sequence of iterates {f k (z)} of a point z under / 
is just the image under hT x of the sequence of iterates {f k (h(z))} of the point 
h(z) under f c . The mapping h transforms the dynamical picture of / to that of 
f c . In particular, f k (z) — > oo if and only if /*(z) — > oo; thus the Julia set of / 
is the image under ft" 1 of the Julia set of f c . 

The transformation h is called a conjugacy between / and f c . Any quadratic 
function is conjugate to f c for some c, so by studying the Julia sets of f c for 
c e C we effectively study the Julia sets of all quadratic polynomials. Since 
h is a similarity transformation, the Julia set of any quadratic polynomial is 
geometrically similar to that of f c for some c G C. 

It should be borne in mind throughout this section that f~ ' (z) takes two distinct 
values ±(z — c) 1 / 2 , called the two branches of f~ l (z), except when z = C. Thus 
if U is a small open set with c $ U , then the pre-image f~ (U) has two parts, 
both of which are mapped bijectively and smoothly by f c onto U. 

We define the Mandelbrot set M to be the set of parameters c for which the 
Julia set of f c is connected 

M = {c e C : J(f c ) is connected}. (14.4) 

At first, M appears to relate to one rather specific property of J(f c ). In fact, as 
we shall see, M contains an enormous amount of information about the structure 
of Julia sets. 

The definition (14.4) is awkward for computational purposes. We derive an 
equivalent definition that is much more useful for determining whether a param- 
eter c lies in M and for investigating the extraordinarily intricate form of M; see 
figure 14.2: we show that ceMif and only if /*(0) ■/* °°- 

To do this, we first need to know a little about the effect of the transformation 
f c on smooth curves. For brevity, we term a smooth (i.e. differentiable), closed, 
simple (i.e. non-self-intersecting) curve in the complex plane a loop. We refer 
to the parts of C inside and outside such a curve as the interior and exterior 
of the loop. A figure of eight is a smooth closed curve with a single point of 
self-intersection. 
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Figure 14.2 The Mandelbrot set M in the complex plane 

Lemma 14.13 

Let C be a loop in the complex plane. 

(a) If c is inside C then f~ l {C) is a loop, with the inverse image of the interior 
of C as the interior of f~ l {C). 

(b) If c lies on C then f~ l (C) is a figure of eight with self-intersection at 
0, such that the inverse image of the interior of C is the interior of the 
two loops. 

(c) If c is outside C, then f~(C) comprises two disjoint loops, with the inverse 
image of the interior of C the interior of the two loops. 

Proof. Note that f~ l (z) = ±(z - c) 1/2 and (f^Yiz) = ±\{z - cy 1 ' 2 , which is 
finite and non-zero if z ^ c. Hence, if we select one of the two branches of f~ , 
the set / C _1 (C) is locally a smooth curve, provided c $ C. 

(a) Suppose c is inside C. Take an initial point uionC and choose one of the 
two values for f~ x {w). Allowing f~ l {z) to vary continuously as z moves 
around C, the point f~ l {z) traces out a smooth curve. When z returns to 
w, however, f~ (w) takes its second value. As z traverses C again, f~ 1 (z) 
continues on its smooth path, which closes as z returns to w the second 
time. Since c ^ C, we have ^ f7 l (C), so f' c (z) ^0 on f~ l (C). Thus 
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f c is locally a smooth bijective transformation near points on f~ (C). In 
particular z e / c (C) cannot be a point of self-intersection of f~ (C), 
otherwise f c (z) would be at a self-intersection of C, so /(C) is a loop. 
Since f c is a continuous function that maps the loop f~ y {C) and no 
other points onto the loop C, the polynomial f c must map the interior and 
exterior of f~ l (C) into the interior and exterior of C respectively. Hence 
f~ l maps the interior of C to the interior of / C _1 (C). 

(b) This is proved in a similar way to (a), noting that if Co is a smooth piece 
of curve through c, then f~ l (Co) consists of two smooth pieces of curve 
through which cross at right angles, providing the self-intersection of the 
figure of eight. 

(c) This is similar to (a), noting that this time, f~(z) can only pick up one 
of the two values as z moves around C, so we get two loops. □ 

We now prove the 'fundamental theorem of the Mandelbrot set' which gives 
an alternative characterization in terms of iterates of f c . 

Theorem 14.14 

M = jceC: {/*(0)}*>i is bounded} (14.5) 

= (ceC: /*(0) Acoas^oo}. (14.6) 



Proof. By Lemma 14.1 it is clear that /*(0) ■/* oo if and only if {/*(0)} is 
bounded, so (14.5) and (14.6) are equal. 

(a) We first show that if {f k (0)} is bounded then J(f c ) is connected. Let 
C be a large circle in C such that all the points {/*(0)} lie inside C, 
such that f~ l (C) is interior to C and such that points outside C iterate 
to oo under /*. Since c — / c (0) is inside C, Lemma 14.13(a) gives that 
/ f '(C) is a loop contained in the interior of C. Also, f c (c) — / c 2 (0) is 
inside C and / _1 maps the exterior of C onto the exterior of / C "'(C), so 
c is inside f~ l (C). Applying Lemma 14.13(a) again, f~ 2 {C) is a loop 
contained in the interior of f~ (C). Proceeding in this way, we see that 
{f t T k (C)} consists of a sequence of loops, each containing the next in its 
interior (figure 14.3(a)). Let K denote the closed set of points that are on 
or inside the loops f~ k (C) for all k. If z G C\A^ some iterate /*(z) lies 
outside C and so /*(z) —*■ oo. Thus 

A(oo) = {z : f c k (z) -> oo as k -> oo} = C\K 

so K is the filled in Julia set of f c . By Lemma 14.11 J(f c ) is the boundary 
of <C\K which is, of course, the same as the boundary of K . But K is the 
intersection of a decreasing sequence of closed simply connected sets (i.e. 
sets that are connected and have connected complement), so, by a simple 
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(«) 



fc(C) 




(b) 

Figure 14.3 Inverse iterates of a circle c under f c , illustrating the two parts of the proof 
of Theorem 14.14: (a) c = -0.3 + 0.3i; (b) c = -0.9 + 0.5i 
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topological argument, K is simply connected and therefore has connected 
boundary. Thus J(f c ) is connected. 
(b) The proof that J(f c ) is not connected if {/*(0)} is unbounded is fairly 
similar. Let C be a large circle such that f~(C) is inside C, such that 
all points outside C iterate to oo, and such that for some p, the point 
// (c) = //(0) e C with /*(0) inside or outside C according as to 
whether k is less than or greater than p. Just as in the first part of the 
proof, we construct a series of loops [f~ k (C)}, each containing the next 
in its interior (figure 14.3(b)). However, the argument breaks down when 
we get to the loop f c P (C), since c e f c P (C) and Lemma 14.13(a) does 
not apply. By Lemma 14. 13(Z?) we get that E = f~ p (C) is a figure of eight 
inside the loop f c P (C), with f c mapping the interior of each half of E 
onto the interior of f c ~ p (C). The Julia set J(f c ) must lie in the interior 
of the loops of E, since other points iterate to infinity. Since J(f c ) is 
invariant under f~ ' , parts of it must be contained in each of the loops of 
E. Thus this figure of eight E disconnects J(f c )- In fact, if we now apply 
Lemma 14.13(c) in the same way, it is easy to see that J(f c ) is totally 
disconnected. □ 

The reason for considering iterates of the origin in (14.5) and (14.6) is that the 
origin is the critical point of f c for each c, i.e. the point for which / e '(z) = 0. The 
critical points are the points where f c fails to be a local bijection — a property 
that was crucial in distinguishing the two cases in the proof of Theorem 14.14 
and which is crucial in analysing the dynamics of any polynomial or meromor- 
phic function. 

The equivalent definition of M provided by (14.5) is the basis of computer 
pictures of the Mandelbrot set. Choose numbers r > 2 and Icq of the order of 100, 
say. For each c compute successive terms of the sequence {/*(0)} until either 
I/* (0)1 ^ r 7 m which case c is deemed to be outside M, noting Exercise 14.12, 
or k — Icq, in which case we take c G M. Repeating this process for values of c 
across a region enables a picture of M to be drawn. Often colours are assigned 
to the complement of M according to the first integer k such that |/*(0)| ^ r. 

Pictures of the Mandelbrot set (see figure 14.2) suggest that it has a highly 
complicated form. It has certain obvious features: a main cardioid to which a 
series of prominent circular 'buds' are attached. Each of these buds is surrounded 
by further buds, and so on. However, this is not all. In addition, fine, branched 
'hairs' grow outwards from the buds, and these hairs carry miniature copies of 
the entire Mandelbrot set along their length. The Mandelbrot set is connected, 
and its boundary has Hausdorff dimension 2, a reflection on its intricacy. 



14.3 Julia sets of quadratic functions 

In this section we will see a little of how the structure of the Julia set J(f c ) 
changes as the parameter c varies across the complex plane. In particular, the 
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significance of the various parts of the Mandelbrot set will start to become 
apparent. 

The attractive periodic points of f c are crucial to the form of J(f c )- It may 
be shown (see Exercise 14.17) that if w ^ oo is an attractive periodic point of a 
polynomial / then there is a critical point z (a point with f'(z) — 0) such that 
f k (z) is attracted to the periodic orbit containing w. Since the only critical point 
of f c is 0, f c can have at most one attractive periodic orbit. Moreover, if c ^ M 
then, by Theorem 14.14, /* (0) — > oo, so f c can have no attractive periodic orbit. 
It is conjectured that the set of c for which f c has an attractive periodic orbit 
fills the interior of M. 

It is natural to categorize f c by the period p of the (finite) attractive orbit, if 
any; the values of c corresponding to different p may be identified as different 
regions of the Mandelbrot set M . 

To begin with, suppose c lies outside M, so f c has no attractive periodic points. 
By definition, J(f c ) is not connected. In fact, J(f c ) must be totally disconnected 
and expressible as the disjoint union J — S\(J) U S2(J), where Si and S2 are 
the two branches of f~ l on J. This means that J is the attractor of the IFS 
{Si, S2}, see (9.2). Basically, this follows from the second half of the proof of 
Theorem 14.14 — we get that f c maps the interior of each loop of a figure of eight 
E onto a region D containing E. The mappings Si and S2 may be taken as the 
restrictions of f~ x to the interior of each loop. Since Si (J) and S2(/) are interior 
to the two halves of E, they are disjoint, so J must be totally disconnected; see 
remarks after (9.7). 

We look at this situation in more detail when c is large enough to allow some 
simplifications to be made. 



Theorem 14.15 

Suppose \c\ > i(5 + 2^6) = 2.475 .... Then J(f c ) is totally disconnected, and 
is the attractor {in the sense of (9.2)) of the contractions given by the two branches 
of f~ l (z) — ±(z — c) 1/2 for z near J. When \c\ is large 

dim B /(/ c ) = dim H /(/ c ) ~ 21og2/log4|c|. 

Proof. Let C be the circle \z\ — \c\ and D its interior \z\ < \c\. Then 

f~\C) = {(ce' e - c) 1/2 : ^ 8 sC 4jt} 

which is a figure of eight with self-intersection point at 0, with the loops on either 
side of a straight line through the origin (see Lemma 14.13 and figure 14.4). 
Since \c\ > 2 we have f~ x {C) C D since if \z\ > \c\ then \f c (z)\ ^ \z 2 \ - \c\ > 
|c| 2 — \c\ > I c\. The interior of each of the loops of f~ l {C) is mapped by f c in a 
bijective manner onto D. If we define S\, S2 : D — > D as the branches of f~ l (z) 
inside each loop, then S\(D) and 82(D) are the interiors of the two loops. 
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Figure 14.4 Proof of Theorem 14.15 

Let V be the disc [z : \z\ < |2c|^ 2 }. We have chosen the radius of V so that 
V just contains f-\C), so Si(D), S 2 (D) C V C D. Hence Si(V), 5 2 (V) C V 
with S](V) and ^(V) disjoint. We have, for i — 1, 2, 



|S,-fei)-£fe)l = l(zi-c) 



1/2 



fo-c) 1 / 2 ^ 



[Zl -Z2I 



Hence, if z\, Zi S V, taking least and greatest values, 



i(|c| + 12c!'/ 2 )- 1 / 2 < '^ fel) 5 ' fo)l < \{\c\ - \2c\^T'l\ 

[Z1-Z2I 



(14.7) 



The upper bound is less than 1 if \c\ > \(5 + 2^6), in which case Si and S2 
are contractions on the disc V. By Theorem 9.1 there is a unique non-empty 
compact attractor F C V satisfying 



S y (F) U S 2 (F) = F. 



(14.8) 



Since Si(V) and ^(V) are disjoint, so are Si(F) and 82(F), implying that F is 
totally disconnected. 

Of course, F is none other than the Julia set J — J (/ c ). One way to see this is 
to note that, since V contains at least one point z of J (for example, a repelling 
fixed point of f c ), we have J — closure (Utli / c _t (z)) C V , since f~ k (V) C V . 
Using further results from Summary 14.12, J is a non-empty compact subset of 
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V satisfying J — f~ l (J) or, equivalently, J — S\(J) U S^./). Thus J — F, the 
unique non-empty compact set satisfying (14.8). 

Finally, we estimate the dimension of J(f c ) — F. By Propositions 9.6 and 
9.7 and (14.7), lower and upper bounds for dimuJ(f c ) are provided by the 
solutions of 2(i(|c| ± |2c| 1/2 )~ 1/2 )' 5 = 1, i.e. by s = 21og2/log4(|c| ± |2c| 1/2 ), 
which gives the stated asymptotic estimate. □ 

We next turn to the case where c is small. We know that if c = then J(f c ) is 
the unit circle. If c is small and z is small enough, then /*(z) — ► w as k — > oo, 
where w is the attractive fixed point j (l — Vl — 4c) close to 0. On the other 
hand, /*(z) — »• oo if z is large. It is not unreasonable to expect the circle to 
'distort' into a simple closed curve (i.e. having no points of self-intersection) 
separating these two types of behaviour as c moves away from 0. 

In fact, this is the case provided that f c retains an attractive fixed point, i.e. 
if |/ c '(z)| < 1 at one of the roots of f c (z) — z. Simple algebra shows that this 
happens if c lies inside the cardioid z — j£ l6 (l — \z' e ) (0^9^ 2jr) — this is 
the main cardioid of the Mandelbrot set, see Exercise 14.15. 

For convenience, we treat the case of \c\ < j, but the proof is easily modified 
if f c has any attractive fixed point. 

Theorem 14.16 

If \c\ < j then J(f c ) is a simple closed curve. 

Proof. Let Co be the curve |z| = j, which encloses both c and the attractive fixed 
point w of f c . Then by direct calculation the inverse image f~ 1 (Co) is a loop 
C\ surrounding Cq. We may fill the annular region A t between Co and C\ by a 
continuum of curves, which we call 'trajectories', which leave Co and reach C\ 
perpendicularly; see figure 14.5(a). For each 8 let i/^i (&) be the point on C\ at 
the end of the trajectory leaving Co at ifo(9) — \e' e ■ The inverse image f~ l (A\) 
is an annular region A^ with outer boundary the loop C2 = f c (C\) and inner 
boundary C\, with f c mapping Ai onto A\ in a two-to-one manner. The inverse 
image of the trajectories joining Co to C\ provides a family of trajectories joining 
C\ to C2. Let ^2(0) be the point on C2 at the end of the trajectory leaving C\ at 
i/f\(6). We continue in this way to get a sequence of loops Ct, each surrounding 
its predecessor, and families of trajectories joining the points Vofc(#) ° n Ct to 
VOfc+i(0) on Ci + \ for each k. 

As k —> ex), the curves Ct approach the boundary of the basin of attraction of 
W, by Lemma 14.11 this boundary is just the Julia set J(f c )- Since |/ c '(z)| > y 
for some y > 1 outside C\, it follows that f~ is contracting near J . Thus the 
length of the trajectory joining fa (6) to ifr k+1 (9) converges to at a geometric rate 
as k — >■ 00. Consequently ir k (9) converges uniformly to a continuous function 
if (9) as k — > 00, and J is the closed curve given by ir(9) (0^9^ In'). 

It remains to show that x/r represents a simple curve. Suppose that i]/(9\) — 
ir(9i). Let D be the region bounded by Co and the two trajectories joining 
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Figure 14.5 Proof of Theorem 14.16 

i/fo(0i) and ^0(^2) to this common point. The boundary of Z) remains bounded 
under iterates of / c , so by the maximum modulus theorem (that the modulus of 
an analytic function takes its maximum on the boundary point of a region) D 
remains bounded under iteration of /. Thus D is a subset of the filled-in Julia 
set, so the interior of D cannot contain any points of J . Thus the situation of 
figure 14.5(fo) cannot occur, so that iff (6) — ir{6\) — ^(82) for all between 9\ 
and 02- It follows that is (9) has no point of self-intersection. □ 

By an extension of this argument, if c is in the main cardioid of M, then J(f c ) 
is a simple closed curve; such curves are sometimes referred to as quasi-circles. 
Of course, J (f c ) will be a fractal curve if c > 0. It may be shown that, for small 
c, its dimension is given by 



5 = dim B 7(/ c ) = dim H /(/ e ) = 1 + 



\c\ 2 
4 log 2 



+ terms in \c\ and higher powers. 

(14.9) 
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Moreover, < H s (J) < oo, with dim B 7(/ c ) = dim H ./(/ c ) given by a real ana- 
lytic function of c. 

The next case to consider is when f c has an attractive periodic orbit of period 
2. By a straightforward calculation this occurs if \c + 1| < \\ that is, if z lies in 
the prominent circular disc of M abutting the cardioid, see Exercise 14.16. 

Since / c 2 is a polynomial of degree 4, f c has two fixed points and two period- 
2 points. Let W\ and u>2 be the points of the attractive period-2 orbit. It may 
be shown, as in the proof of Theorem 14.16, that the basin of attraction for w, 
(i.e. {z : f c 2k (z) — > Wi as k — > oo}) includes a region bounded by a simple closed 
curve C, surrounding w,-, for i = 1,2. By Lemma 14.11 and Proposition 14.4, 
C, C J (fc) — J(fc)- The curves C, are mapped onto themselves in a two-to-one 
fashion by / c 2 , which implies that there is a fixed point of / e 2 on each C,-. The 
period-2 points are strictly inside the C,-, so there is a fixed point of f c on each 
C,; since the C, are mapped onto each other by f c , the only possibility is for C\ 
and C2 to touch at one of the fixed points of f c . The inverse function f~ l is two- 
valued on C\ . One of the inverse images is C2 (which encloses 102)- However, 
the other branch of f~ x (C\) is a further simple closed curve enclosing the second 
value of / -1 (u>i). We may continue to take inverse images in this way to find 
that J(f c ) is made up of infinitely many simple closed curves which enclose the 
pre-images of W\ and W2 of all orders and touch each other in pairs at 'pinch 
points' — see figure 14.7(c). Thus we get fractal Julia sets that are topologically 
much more complicated than in the previous cases. 

It is possible to use such ideas to analyse the case when f c has an attractive 
periodic orbit of period p > 2. The immediate neighbourhoods of the period-/? 
points that are drawn into the attractive orbits are bounded by simple closed 
curves which touch each other at a common point. The Julia set consists of these 
fractal curves together with all their pre-images under f k . 

A variety of examples are shown in figures 14.6 and 14.7. The 'buds' on the 
Mandelbrot set corresponding to attractive orbits of period p are indicated in 
figure 14.8. 

The Julia sets J(f c ) that are most intricate, and are mathematically hardest to 
analyse are at the 'exceptional' values of c on the boundary of M. If c is on the 
boundary of the cardioid or a bud of M, then f c has an indifferent periodic point 
(where \(f p )'(w)\ — 1). If C is at a 'neck' where a bud touches a parent region, 
then J(f c ) includes a series of 'tendrils' joining its boundary to the indifferent 
periodic points. For c elsewhere on the boundary of the cardioid the Julia set 
may contain 'Siegel discs'. The Julia set J(f c ) consists of infinitely many curves 
bounding open regions, with f c mapping each region into a 'larger' one, until 
the region containing the fixed point is reached. Inside this Siegel disc, f c rotates 
points on invariant loops around the fixed point. 

There are still further possibilities. If c is on one of the 'hairs' of M then J(f c ) 
may be a dendrite, i.e. of tree-like form. This occurs if an iterate of the critical 
point is periodic, i.e. if /*(0) = f c q (0) for positive integers k and q. 

We have mentioned that there are miniature copies of M located in the hairs 
of M. If c belongs to one of these, then J(f c ) will be of dendrite form, but with 
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Figure 14.6 Julia sets J(f c ) for c at various points in the Mandelbrot set. The Julia sets 
are displayed in more detail in figure 14.7 



small copies of the Julia set from the corresponding value of c in the main part 
of M inserted at the 'vertices' of the dendrite. 

A good way to explore the structure of Julia sets and, indeed, the Julia sets 
of other functions, is using a computer. There are two usual methods of drawing 
Julia sets, based on properties that we have discussed. 

For the first method, we choose a repelling periodic point z. For suitable k, we 
may compute the set of inverse images J^ — f~ k (z). By Corollary 14.8(£>) these 
2 k points are in J , and should fill J as k becomes large. A difficulty with picturing 
J in this way is that the points of J^ need not be uniformly distributed across 
J — they may tend to cluster in some parts of J and be sparse in other parts. 
Consequently, even with k quite large, parts of J can be missed altogether. (This 
tends to happen for f c with c close to the boundary of M.) There are various 
ways of getting around this difficulty. For instance, with Jq — [z], instead of 
taking /$ = /(/*_!) for each k, we can choose a subset Jk of f~ 1 (Ji c -i) by 
ignoring all but one of the points in every 'small' cluster. This ensures that we 
are working with a reasonably well distributed set of points of J at each step of 
the iteration, and also reduces the calculation involved. 

A second method is to test individual points to see if they are close to the 
Julia set. Suppose, for example, that / has two or more attractive fixed points 
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Figure 14.7 A selection of Julia sets of the quadratic function f c (z) = Z + c. (a) c = 
—0.1 + 0.1/; f c has an attractive fixed point, and / is a quasi-circle. (b) c = —0.5 + 0.5;; 
f c has an attractive fixed point, and / is a quasi-circle. (c) c = — 1 + 0.05/ ; f c has an 
attractive period-2 orbit, (d) c = —0.2 + 0.75/, f c has an attractive period-3 orbit, (e) 
c = 0.25 + 0.52/; f c has an attractive period-4 orbit. (/) c= -0.5 + 0.55/;/,. has an 
attractive period-5 orbit, (g) c = 0.66/; f c has no attractive orbits and / is totally discon- 
nected, (h) c = —i, //(O) is periodic and J is a dendrite 
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Figure 14.8 The periods of the attractive orbits of f c for c in various parts of the Man- 
delbrot set M. If c is in the main cardioid, f c has an attractive fixed point, and the Julia 
set J(f c ) is a quasi-circle. For c in the buds of M, f c has an attractive orbit with the 
period p shown, with p regions inside the Julia set J(f c ) meeting at each pinch point. 
Outside M, the function f c has no attractive orbits and J(f c ) is totally disconnected 

(now including oo if / is a polynomial). If z is a point of /(/) then there 
are points arbitrarily close to z in the attractive basin of each attractive point 
by Lemma 14.11. To find J we divide a region of C into a fine mesh. We 
examine the ultimate destination under iteration by / of the four corners of each 
mesh square. If two of the corners are attracted to different points, we deem the 
mesh square to contain a point of J. Often, the other squares, the 'Fatou set', 
are coloured according to which point the vertices of the square is attracted to, 
perhaps with different shading according to how close the fcth iterates are to the 
attractive point for some fixed k. 

Both of these methods can be awkward to use in certain cases. A knowledge 
of the mathematical theory is likely to be beneficial in overcoming the difficulties 
that can occur. 



14.4 Characterization of quasi-circles by dimension 

We saw in the previous section that, if c is in the main cardioid of the Mandelbrot 
set, then the Julia set of f c (z) — z 2 + c is a simple closed curve. By similar 
arguments, the Julia set of f(z) — z" + c is a simple closed curve for any integer 
n ^ 2 provided that c is small enough, and, indeed, the same is true for f(z) — 
z 2 + g(z) for a wide variety of analytic functions g that are 'sufficiently small' 
near the origin. Thus all these functions have Julia sets that are topologically the 
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same — they are all homeomorphic to a circle. The surprising thing is that they 
are essentially the same as fractals, in other words are bi-Lipschitz equivalent, 
if and only if they have the same Hausdorff dimension. Of course, if two sets 
have different dimensions they cannot be bi-Lipschitz equivalent (Corollary 2.4). 
However, in this particular situation the converse is also true. 

We term a set F a quasi-self-similar circle or quasi-circle if the following 
conditions are satisfied: 

(i) F is homeomorphic to a circle (i.e. F is a simple closed curve), 
(ii) < TL S (F) < oo where s — dimuF. 

(iii) There are constants a,b,r > such that for any subset U of F with 
[ U | ^ r there is a mapping <p : U — > F such that 

a\x-y\^ \U\\<p(x)-<p{y)\^b\x-y\ (x,yeF). (14.10) 

The 'quasi-self-similar' condition (iii) says that arbitrarily small parts of F are 
'roughly similar' to a large part of F. 

The following theorem depends on using 5 -dimensional Hausdorff measure to 
measure 'distance' round a quasi-circle. 

Theorem 14.17 

Quasi-circles E and F are bi-Lipschitz equivalent if and only if dmin^ = Aim^F ' . 

Sketch of proof. If there is a bi-Lipschitz mapping between E and F then 
dmiH^ = dimuF by Corollary 2.4(b). 

Suppose that dirne^ = dimuF. Let E(x, y) be the 'arc' of E between points 
x, y e E, taken in the clockwise sense, with a similar notation for arcs of F. 
Conditions (ii) and (iii) imply that H s (E(x, y)) is continuous in x, y e E and is 
positive if x ^ y. We claim that there are constants c\, Ci > such that 

^ H'(E(x,y)) 

c\ ^ — : — ^ c 2 (14.11) 

\x - y\ s 

whenever E(x, y) is the 'shorter' arc, i.e. 7i s (E(x, y)) < 7i s (E(y, x)). Assume 
that £ > is sufficiently small. If \x — y\ ^ e then (14.11) is true by a continuity 
argument for suitable constants. If \x — y\ < e there is a mapping <p : E(x, y) — > 
E satisfying (14.10) such that \<p(x) - <p(y)\ ^ e. Inequalities (14.10), (2.8) and 
(2.9) imply that the ratio (14.11) changes by a bounded amount if x and y are 
replaced by <p(x) and <p(y), so (14.1 1) holds for suitable c\ and c-i for all x, y e E. 
Now choose base points p e E, q e F. Set y — TL S (E) /TL S (F) and define 
i/f : E — >■ F by taking i}r(x) to be the point of F such that 

H s (E(p,x)) = yH s (F(q,ir(x))) 

(see figure 14.9). Then \jj is a continuous bijection, and also 

H s (E(x, y)) = yH s (F(f(x), f(y))) (x, y e E). 
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W(F(q, f(x))) 

= yH s (E(p, x)) 



f( X ) 



Figure 14.9 Setting up a bi-Lipschitz mapping xjr between two quasi-circles of Hausdorff 
dimension s 

Using (14.11) together with similar inequalities for arcs of F, this gives 

\f(x)-if(y)\ 
C3 ^ ; ; ^ C 4 

\x-y\ 

if x ^ y, so that \\r is bi-Lipschitz, as required. □ 



Corollary 14.18 

Suppose that the Julia sets J\ and J2 of the polynomials f\ and fi are simple 
closed curves. Suppose that f is strictly repelling on ./,■ (i.e. \f((z)\ > 1 for i — 
1, 2). Then J\ and J2 are bi-Lipschitz equivalent if and only if dime*/! = dinted- 

Proof. It may be shown that if a polynomial / is strictly repelling on its Julia set 
/ then < H S (J) < 00, where s — dim H i. Moreover, given a subset U of J, we 
may choose k so that f k (U) has diameter comparable with that of J itself, and 
(14.10) holds taking <p — f k (this reflects the quasi-self-similarity of J). Thus 7] 
and J2 are quasi-circles to which Theorem 14.17 may be applied. □ 



14.5 Newton's method for solving polynomial equations 

Anyone who has done any numerical analysis will have found roots of equations 
using Newton's method. Let p(x) be a function with continuous derivative. 
If f(x) — x — p(x)/p'(x) then the iterates f k (x) converge to a solution of 
p(x) — provided that p'{x) ^ at the solution and that the initial value of x is 
chosen appropriately. Cayley suggested investigating the method in the complex 
plane, and in particular which initial points of C iterate to which zero of p. 
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Let p : C — > C be a polynomial with complex coefficients, and form the ratio- 
nal function / : C U {00} -> C U {00} 

f(z) = Z-p(z)/p'(z). (14.12) 

Then the fixed points of /, given by p(z)/p'(z) — 0, are the zeros of p together 
with 00. Differentiating, we find that 

f'(z) = p(z)p"(z)/ P '(zf (14.13) 

so a zero z of p is a super attractive fixed point of /, i.e. f'(z) — 0, provided 
that p'(z) ^ 0. If \z\ is large, f(z) ~ z(l — 1/w), where n is the degree of p, so 
00 is a repelling point of /. As usual, we write 

A(w) = {z : f k (z) -> w} (14.14) 

for the basin of attraction of the zero w, i.e. the set of initial points which 
converge to w under Newton iteration. Since the zeros are attractive, the basin 
A{w) includes an open region containing w. We shall see, however, that A(w) 
can be remarkably complicated further away from w. 

The theory of Julia sets developed for polynomials in Section 14.1 is almost 
the same for a rational function, provided that the point 00 is included in the 
natural way. The main differences are that if / is a rational function /(/) need 
not be bounded (though it must be closed) and it is possible for J(f) to have 
interior points, in which case J(f) = CU {00}. However, Lemma 14.11 remains 
true, so that J(f) is the boundary of A(w) for each attractive fixed point w. Thus 
/(/) is likely to be important when analysing the domains of attraction of the 
roots in Newton's method. 

A straightforward case is the quadratic polynomial 

p(z) = z 2 -c (c^O) 

with zeros ± »Jc (as before, more general quadratic polynomials can be reduced 
to this form by a conjugacy). Newton's formula (14.12) becomes 



Thus 



so 



f(z) = (z 2 + c)/2z. 
f(z) ± sic = (z ± Jcf/2z 
f(z) + Jc (z + Jc~ 



f(z)~s/c \Z-s/c 



(14.15) 



It follows immediately that if \z + s/c\/\z — *Jc\<\ then \f k (z) + sjc\/ 
\f k (z) — V c l — * anc l a l so f k (z) — > — V c as & ~ *■ °°> an< i similarly if 
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\z + ■s/c\/\z - s/c\ > 1 then f k {z) -> Jc. The Julia set /(/) is the line \z + Jc\ 
— \z — aJc\ (the perpendicular bisector of — *Jc and ^Jc) and A{—^/c) and 
A(^/c) are the half-planes on either side. (Letting h(z) — (z + <Jc)/(z — V c ) 
in (14.15) gives f(z) — h~ l (h(z)) 2 , so that / is conjugate to, and therefore has 
similar dynamics to, the mapping g(z) — z 2 .) In this case the situation is very 
regular — any initial point is iterated by / to the nearest zero of p. 

The quadratic example might lead us to hope that the domains of attraction 
under Newton iteration of the zeros of any polynomial are reasonably regular. 
However, for higher-order polynomials the situation is fundamentally different. 
Lemma 14.11 (which holds for rational functions, including Newton functions 
of polynomials) provides a hint that something very strange happens. If p has 
zeros z\, ■ ■ ■ ,z n with p'(zi) ^ 0, Lemma 14.11 tells us that the Julia set of / is 
the boundary of the domain of the attraction of every zero: 

J(f) = dA(zi) = --- = dA( Zn ). 

A point on the boundary of any one of the domains of attraction must be on the 
boundary of all of them; since J(f) is uncountable, there are a great many such 
multiple boundary points. An attempt to visualize three or more disjoint sets with 
this property will convince the reader that they must be very complicated indeed. 
Let us look at a specific example. The cubic polynomial 

p(z) = z 3 - 1 
has zeros 1, e' 271 "/ 3 , e' 4lr / 3 , and Newton function 

2z 3 + l 



f(z) = 



3z 2 



The transformation p(z) — z e' 2lr ^ 3 is a rotation of 120° about the origin. It is 
easily checked that f(p(z)) — p(f(z)), in other words p is a conjugacy of / 
to itself. It follows that a rotation of 120° about the origin maps A(w) onto 
A(w e' 2lr / 3 ) for each of the three zeros w, so that the Julia set has threefold sym- 
metry about the origin. (Of course, these symmetries would be expected from 
the symmetric disposition of the three zeros of p.) If z is real then f k (z) remains 
real for all k, and, by elementary arguments, f k {z) converges to 1 except for 
countably many real z. Thus A(l) contains the real axis except for a countable 
number of points, and, by symmetry, A(e' 23r ^ 3 ) and A(e ' 3 ) contain the lines 
through the origin making 120° and 240° to the real axis, again except for count- 
ably many points. We also know that each A(w) contains an open region round 
w, that any point on the boundary of one of the A(w) is on the boundary of 
all three, and that there are uncountably many such 'triple points'. Most peo- 
ple require the insight of a computer drawing to resolve this almost paradoxical 
situation, see figure 14.10. 

The domain A(l) is shown in black in figure 14.10(£>); note that the basins 
of attraction of the other two zeros, obtained by rotation of 120° and 240°, key 



240 



Iteration of complex functions — Julia sets 





Figure 14.10 Newton's method for p(z) = z 3 — 1. The Julia set for the Newton function 
f(z) = (2z 3 + l)/3z 2 is shown in (a), and the domain of attraction of the zero z = 1 is 
shown in black in (b) 
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into the picture in a natural way. The Julia set shown in figure 14.10(a) is the 
boundary of the three basins and is made up of three 'chains' leading away from 
the origin. These fractal chains have a fine structure — arbitrarily close to each 
point of /(/) is a 'slightly distorted' copy of the picture at the origin with six 
components of the A(w) meeting at a point. This reflects Corollary 14.8(£>), again 
valid for rational functions: /(/) is the closure of Utli f k (^)> so 'hat if z is 
in /(/) then there is a point w arbitrarily close to z, and an integer k, such 
that f k (w) — 0. But /* is locally a conformal mapping, so that the local inverse 
(f k )~ 1 maps a neighbourhood of to an 'almost similar' neighbourhood of w. 
The Julia set /(/) exhibits quasi-self-similarity. 

This, of course, is just the beginning. The domains of attraction of the zeros 
of other polynomials of degree 3 or more and, indeed, other analytic functions, 
may be investigated using a combination of theory and computer graphics. This 
leads to a wealth of sets of a highly intricate form which raise many intriguing 
mathematical questions. 

In this chapter we have touched on what is a complicated and fascinating area 
of mathematics in which fractals play a fundamental role. It is an area in which 
computer experiments often lead the way with mathematical theory trying to keep 
up. The variations are endless — we can investigate the Julia set of higher-order 
polynomials and of other analytic functions such as exp z, as well as invariant 
sets of non-analytic transformations of the plane. With the advent of high quality 
colour computer graphics, these ideas have become the basis of much computer 
art. A single function of simple form can lead to highly intricate yet regular 
pictures — often very beautiful, sometimes almost weird. 



14.6 Notes and references 

Much of the basic theory of iteration of complex functions was developed by Julia 
(1918) and Fatou (1919). For many years the subject lay almost dormant, until 
computer graphics was sufficiently advanced to reveal the intricate form of Julia 
sets. Recently, there has been an extraordinary interest in the subject, and drawing 
Julia sets and the Mandelbrot set on computers has become a popular activity, 
with the incredibly beautiful and intricate pictures that are easily produced. 

For an account of basic complex variable theory, see Ahlfors (1979). The 
books by Peitgen and Richter (1986), Becker and Dorfier (1989) and Peitgen, 
Jiirgens and Saupe (1992) provide well-illustrated accounts of complex iteration. 
Other recommended accounts of complex dynamics include Blanchard (1984), 
Saupe (1987), Devaney (1989), Beardon (1991), Carleson and Gamelin (1993), 
McMullen (1994) and Milnor (1999). Mandelbrot (1980) introduced the set bear- 
ing his name; the fundamental Theorem 14.14 is given by Brolin (1965). A col- 
lection of papers on topics related to the Mandelbrot set was edited by Tan (2000). 

The formula (14.9) for the dimension of J(f c ) is due to Ruelle (1982), and 
extensions are given by Collet, Dobbertin and Moussa (1992) and Abenda, 



242 Iteration of complex functions — Julia sets 

Moussa and Osbaldestin (1999). For details of the characterization of quasi- 
circles by dimension, see Falconer and Marsh (1989). Fractals associated with 
Newton's method are discussed in Peitgen and Richter (1986), Peitgen, Saupe 
and von Haeseler (1984) and Curry, Garnett and Sullivan (1983). 



Exercises 

Throughout these exercises we write f c (z) = z 2 + c. 

14.1 Show that every perfect set E (i.e. every set that is closed and has no isolated 
points) is uncountable. (Hint: Construct a 'Cantor-like' set inside E.) Deduce that 
J(fc) is uncountable for all c. 

14.2 Describe the Julia set of f(z) = z 2 + 4z + 2. (Hint: Write z\ = z + 2.) 

14.3 Let f(z) = z 2 + 2iz + b. By considering the mapping h(z) = z + I, show that the 
Julia set of / is connected if and only if b + 1 + i £ M. 

14.4 Let |c| < i. Show that if \z\ < \ then \f c (z)\ < \. Deduce that fi(0, ±) C K(f c ). 
Also show that if \z\ > 2 then \f c (z)\ > \\z\. Deduce that K(f c ) C 5(0, 2). What 
does this tell us about the location of /(/ c )? 

14.5 Let f(z) = z 2 — 2. Find a repelling fixed point of /. Deduce from Corollary 14.8 
that /(/) is a subset of the real interval [—2, 2]. Use Theorem 14.14 to deduce 
that J(f) is connected, and hence that it is the interval [—2, 2]. 

14.6 Show that the Julia set J(f c ) is symmetric about the origin (that is z € J{f c ) if 
and only if — z e J(f c )). 

14.7 Show that if c is real with c > i then z £ J(f c ) for all real numbers z. Deduce, 
using Exercise 14.6, that if c > \ then J(f c ) is not connected, and so c £ M. 

14.8 Show that if c is a non-real number with \c\ < \ and w = |(1 + (1 — 4c)~ 1/2 ) is 
the repelling fixed point of fdz) = z 2 + c then f' c (w) is not real. Deduce that the 
simple closed curve that forms the Julia set J(f c ) cannot have a tangent at w. 
Hence deduce that the curve contains no differentiable arcs. 

14.9 Show that if \c\ < \ and \z\ < \ then \fc(z)\ < \- Deduce that 5(0, \) C M. 

14.10 Show that if |c+ 1| < ^ and \z\ < jq then |/ c (/ c (z))| < ^. Deduce that 
S(-1,|)C«. 

14.11 Show that if e > and if \z\ > max(2 + e, \c\) then \f c (z)\ > |z|(l + e). Deduce 
that if |c| > 2 then c £ M. 

14.12 Use Exercise 14.11 to show that M = {c : |/<f(0)| > 2 for some k}. 

14.13 Show that if \c\ < 1 then the Julia set of f(z) = z 3 + 3z is a simple closed curve. 

14.14 Obtain an estimate for the dimension of the Julia set of f(z) = z 3 + c when \c\ 
is large. 

14.15 Show that f c has an attractive fixed point precisely when c lies inside the main 
cardioid of the Mandelbrot set given by z = js w (l - \t w ) for < ^ 2jt. 

14.16 Show that f 2 (z) - z = (z 2 - z + c){z 2 + z + c + 1). Deduce that / has an attrac- 
tive period-2 point just when |c+ 1| < 4 (the large disc in M). 

14.17 Show that if w is an attractive fixed point of f c then the attractive basin A(w) must 
contain the point c. (Hint: Show that otherwise there is an open neighbourhood 
of w on which the inverse iterates f~ k of / can be uniquely defined and form 
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a normal family, which is impossible since w is a repelling fixed point of f~ l .) 
Deduce that f c can have at most one attractive fixed point. Generalize this to show 
that if w is an attractive fixed point of any polynomial / then A(w) contains a 
point f(z) for some z with f'(z) = 0. 

14.18 Let / be a quadratic polynomial. Show by applying Exercise 14.17 to f for 
positive integers p, that / can have at most one attractive periodic orbit. 

14.19 Write a computer program to draw Julia sets of functions (see the end of 
Section 14.3). Try it out first on quadratic functions, then experiment with other 
polynomials and rational functions, and then other functions such as exp z. 

14.20 Use a computer to investigate the domains of attraction for the zeros of some other 
polynomials under Newton's method iteration; for example for p(z) = z 4 — 1 or 
p(z) = z 3 -Z. 



Chapter 15 Random fractals 



Many of the fractal constructions that have been encountered in this book have 
random analogues. For example, in the von Koch curve construction, each time 
we replace the middle third of an interval by the other two sides of an equilateral 
triangle, we might toss a coin to determine whether to position the new part 
'above' or 'below' the removed segment. After a few steps, we get a rather 
irregular looking curve which nevertheless retains certain of the characteristics 
of the von Koch curve; see figure 15.1. 

The middle third Cantor set construction may be randomized in several ways, 
as in figure 15.2. Each time we divide a segment into three parts we could, instead 
of always removing the middle segment, throw a die to decide which parts to 
remove. Alternatively, we might choose the interval lengths at each stage of the 
construction at random, so that at the feth stage we are left with 2 k intervals of 
differing lengths, resulting in a rather irregular looking fractal. 

Whilst such 'random fractals' do not have the self-similarity of their non- 
random counterparts, their non-uniform appearance is often rather closer to nat- 
ural phenomena such as coastlines, topographical surfaces or cloud boundaries. 
Indeed, random fractal constructions are the basis of many impressive computer- 
drawn landscapes or skyscapes. 

Most fractals discussed in this book involve a sequence of approximations 
Eii, each obtained from its predecessor by modification in increasingly fine 
detail, with a fractal F as a limiting set. A random fractal worthy of the 
name should display randomness at all scales, so it is appropriate to introduce 
a random element at each stage of the construction. By relating the size of 
the random variations to the scale, we can arrange for the fractal to be statis- 
tically self-similar in the sense that enlargements of small parts have the same 
statistical distribution as the whole set. This compares with (non-random) self- 
similar sets (see Chapter 9) where enlargements of small parts are identical to 
the whole. 

In order to describe fractal constructions involving infinitely many random 
steps with precision, we must use the language of probability theory, a brief 
survey of which is given in Section 1 .4. 
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Figure 15.1 Construction of a 'random von Koch curve'. At each step a coin is tossed 
to determine on which side of the removed segment to place the new pair of segments 
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Figure 15.2 Two random versions of the Cantor set. In (a) each interval is divided into 
three equal parts from which some are selected at random. In (b) each interval is replaced 
by two subintervals of random lengths 
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15.1 A random Cantor set 

We give a detailed analysis of a specific statistically self-similar construction. 
It resembles that of the middle third Cantor set, except that the lengths of the 
intervals at each stage are random. This analysis extends to much more general 
random fractals. 

Intuitively, we consider a construction F — Htli ^*> where [0, 1] = Eq D 
Ei D • • • is a decreasing sequence of closed sets, with E^ a union of 2 k disjoint 
closed kth level basic intervals. We assume that each kth level interval / contains 
two (k + l)th level intervals / L and / R , abutting the left- and right-hand ends of / 
respectively. The lengths of the intervals are random, and we impose statistical 
self-similarity by the requirement that the length ratios |/lI/I^I have independent 
and identical probability distribution for every basic interval / of the construction, 
and similarly for the ratios |/r|/|/|. This 'random Cantor set' F is statistically 
self-similar, in that the distribution of the set F D I is the same as that of F, but 
scaled by a factor |/|, for each /. 

We describe this random construction in probabilistic terms. Let a , b be con- 
stants with < a $C b < i. We let £2 denote the class of all decreasing sequences 
of sets [0, 1] = Eq D Ei D E2 D ■ ■ ■ satisfying the following conditions. The set 
Ek comprises 2 k disjoint closed intervals h u ...,i k indexed by ji,...,4, where 
ij = 1 or 2 (1 ^ ;' ^ k); see figure 15.3. The interval /,,,...,,,. of E^ contains the 

two intervals h lt ...j k ,i and 1^ ,- ti 2 of Ejt+i, with the left-hand ends of Z,,,...,^ 

and /,-, i k j and the right-hand ends of h u ...,i k and Ii lt ...j k ,2 coinciding. We write 

Ciu...,h = |Jii,...,i t l/l4,...,it_il> and suppose that a ^ C iu ... Jk < b for all i u ..., i k . 
We thus define the random Cantor set F — H*li Ek- 
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'2, 1 l 2, 2 



Figure 15.3 Construction of the random Cantor set analysed in Theorem 15.1. The length 

ratios |/,,,...,,i,i|/|//| ,J have the same statistical distribution for each i\,...,i k , and 

similarly for \I iu ...j t , 2 \/\Ii i t \ 
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We take £2 to be our sample space, and assume that a probability measure P is 
defined on a suitably large family J 7 of subsets of £2, such that the ratios Ci lt ...j t 
are random variables. We impose statistical self-similarity on the construction by 

requiring C,-, ,- t> i to have the same distribution as C\ — \I\\, and Qj,...,^ to 

have the same distribution as Ci — \h\ f° r every sequence i\, . . , , j*. We assume 
that the Ci u ...j k are independent random variables, except that for each sequence 
i\, . . . , 4 we do not require Q, _. ,.j k j and C,, ... i(i 2 to t> e independent. It may be 
shown that dimH-F is a random variable which may be expressed in terms of the 
C 

The following result is a random version of Theorem 9.3. 

Theorem 15.1 

With probability 1, the random Cantor set F described above has dimn-F = s, 
where s is the solution of the expectation equation 

E(CJ + C£) = 1. (15.1) 

*Proof. It is easy to see that E(C[ + C s 2 ) is continuous and strictly decreasing in 
s, and thus that (15.1) has a unique solution. 

By slight abuse of notation, we write I e E^to mean that the interval / is a kth 
level interval /,, ,- 4 of E^. For such an interval / we write 7l and 7r for Ii u ...j kt \ 
and /,, , t 2 respectively. We write E(X|^) for the conditional expectation of a 
random variable X given a knowledge of the C,,,..., for all sequences i\, .., ij 
with j ^ fe. (Intuitively, we imagine that Eq, . . . , E^ have been constructed, and 
we are analysing what happens thereafter.) Let /,,,...,,j be an interval of E^. Then 
for s > 

E((i/« 1 ,...,«„ir + i/ !1 ,..., !t , 2 r)i^) = e(c?,... Ai1 + q,..., ii2 )i/«„...,4i i 

= E(c-[ + q)|/,,,...,,,|' s 

by virtue of the identical distribution. Summing over all the intervals in Et , 

X/efit+i / IeE k 

It follows that the unconditional expectation satisfies 

E( J2 l 7 n= E ( J^\I\ s )e(C{ + C s 2 ). (15.3) 

With s the solution of (15.1), equation (15.2) becomes 

•^) = Ei 7 i s - (15 - 4) 

/ ieE k 
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Readers familiar with probability theory will recognize (15.4) as saying that the 
sequence of random variables 

X k = J^\I\ s (15.5) 

IeE k 

is a martingale with respect to T k . Moreover, a routine calculation, see Exercise 
15.7, shows that it is an L 2 -bounded martingale, that is there is a number c such 
that E(Xf) SC c for all k. The crucial fact for our purposes, which we ask other 
readers to take on trust, is that, in this situation, X k converges with probability 
1 as k — >■ oo to a random variable X such that E(X) — E(Xq) — E(l ? ) = 1. In 
particular $; X < oo with probability 1, and X — with probability q < 1. 
But X = if and only if 5Z /e£ n/ \I\ S and ^2 l€E nI \I\ S both converge to as 
k — > oo, where I\ and I2 are the intervals of E\, and this happens with probability 
q 2 , by virtue of the self-similarity of the construction. Hence q — q 2 , so q — 0, 
and we conclude that < X < 00 with probability 1. In particular, this implies 
that with probability 1 there are (random) numbers Mi , Mi such that 

< Mi sC X k = ^ l y I'' ^ M 2 < °° ( 15 - 6 ) 

IeE t 

for all k. We have \I\^2~ k for all I e E k , so, W S (F) sC T., e E k I 7 1* < M 2 if 
k ^ — log 5/ log 2, giving TL S (F) ^ M^- Thus dimn/ 7 $5 * with probability 1. 

We use the potential theoretic method of Section 4.3 to derive the almost sure 
lower bound for dim^F. To do this we introduce a random mass distribution /x on 
the random set F. Let s satisfy (15.1). For I e E k let n(I) be the random variable 

li(I) = lim V^2 \J\ S ■ J e £ 7 - and 7c/}. 

As with (15.5) this limit exists, and < /x(/) < 00 with probability 1. Further- 
more, if 7 e E k , 

E(n(I)\T k ) = \I\ s . (15.7) 

If I e Ek then /x(/) = /x(/l) + m(/r) so /a is additive on the kth level sets for all 
k, and so, /a extends to a mass distribution with support contained in Pltlo ^* = 
F, see Proposition 1.7. (We ignore measure theoretic questions connected with 
the definition of it.) 

We fix < / < s and estimate the expectation of the f-energy of /x. If x, y e F, 
there is a greatest integer k such that x and y belong to a common feth level 
interval; denote this interval by X A y. If I is a feth level interval, its (k + l)th 
level subintervals 7 L and 7 R are separated by a gap of at least d \ I | , where d — 
1 - 26. Thus 

[f \x - y\-'dii(x)d[i(y) = 2 f f \x - y|-'d M (x)d/x(y) 

JJ x/\y=I JxelL J y^lR 

^2J-'|/rV(/ L )M(/R)- 
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If / e E k , 



II \x - y\- t dn(x)dix(y) 

JJxAy=I 



Ti 



4 + 1 



s:2rf-'|/r'E(/x(/ L )|^ +1 )E(/x(/ R )|^ +1 ) 

^2j-'|/r'i/Li s i/Ri s 

^2d-'\I\ 2s ~' 

using (15.7). Using a variation of (1.21) this gives an inequality for the uncon- 
ditional expectation 



// ' 

JJ xA.y=l 



E ( // \x - y\-'dn(x)dn(y) ) ^ 2d~>E(\I\ 2s ~<). 

\y= 



Summing over / e E 



E ( E ff \ x - yr'^Mdfiiy) j ^ u-'tl J2 I 7 I 2w ) = 2d ~' xk 

\ I e E^ J / \ / € Ek / 

where A = E(Cf s ~' + C^~ f ) < 1, using (15.3) repeatedly. Then 

e (7 /" i* - yr ( d / x(x)d M (y)) = e( v v /"/" |jc - 3>r f d/z(*)dMOo 

W Jf / \ ^Q JeEt JJxAy=I , 



Id' 1 J2 * 



4 < (X), 



*=o 



so that the f -energy of /x is finite, with probability 1. As we have noted, 
< /x(F) = n([0, 1]) < oo with probability 1, so dime/ 7 ^ t by Theo- 
rem 4.13(a). □ 

Note that the proof of Theorem 15.1 is typical of many random models, in 
that the key to both upper and lower estimates is the estimation of the moments 
E(£ /e£t mO for certain?. 

This theorem and proof generalize in many directions. Each interval in E^ 
might give rise to a random number of intervals of random lengths in E^+x- Of 
course, the construction generalizes to IR", and the separation condition between 
different component intervals can be relaxed, provided some sort of 'open set 
condition' (see (9.11)) is satisfied. The following construction is a full random 
analogue of the sets discussed in Section 9.2. 

Let V be an open subset of W with closure V, let m ^ 2 be an integer, and 
let < b < 1 . We take Q to be the class of all decreasing sequences V — Eq d 
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Ei D Ei D • • • of closed sets satisfying the following conditions. The set E^ is a 
union of the m k closed sets V,,,...,,^ where ij = 1, , . . , m (1 ^ j ^ k) and V;,,....;* 
is either similar to V or is the empty set. 

We assume that, for each i\, . . . , 4, the set V^,...,^ contains V;,,....,^.,- (1 ^ / ^ m) 
and that these sets are disjoint; this is, essentially, equivalent to the open set 
condition. If V iu _ Jt is non-empty, we write C iu _ A = \ V iu _ Jk \/\ V;,,...,; t _, | for the 
similarity ratio between successive sets and we take C,, .. , t = if V| li .,. i ,- 4 is the 
empty set. We define the random self-similar set F — P|*lo ^* • 

Let P be a probability measure on a family of subsets of Q such that the C,, .. .., t 
are random variables. Suppose that given C,, . , t > 0, i.e. given that V;,,...,; t is 
non-empty, C,, . , t , has identical distribution to C, for each sequence i\, . . . , i^ 
and for 1 ^ i ^ m. We assume that the C,, . ..,4 are independent, except that, 
for each sequence i\, . . , , it, the random variables C,, . .41, . . . , C;, ,..j k , m need 
not be independent. This defines a self-similar probability distribution on the 
constructions in Q,. We write N for the (random) number of the C\, . . . , C* that 
are positive; that is, the number of the sets Vj , . . . , V^ that are non-empty. 

Theorem 15.2 

The random set F described above has probability q of being empty, where t — q 
is the smallest non-negative solution to the polynomial equation 

f(t) = j2 p ( N = ri ti = t - < 15 - 8 > 

With probability 1 — q the set F has Hausdorff and box dimensions given by the 
solution s of 

I m \ 

" ' (15.9) 




*Note on proof. Basically, this is a combination of the probabilistic argument of 
Theorem 15.1 and the geometric argument of Theorem 9.3. Note that, if there is 
a positive probability that N — 0, then there is a positive probability that E\ — 
and therefore that F — 0. This 'extinction' occurs if each of the basic sets in 
Ex becomes extinct. By the self-similarity of the process, the probability go of 
this happening is f(qo), so qo — f(qo). If q is the least non-negative solution 
of f(q) = q, then, using that / is increasing, an inductive argument shows 
that P(E k = 0) = f(P(E k _i = 0)) ^ f(q) = q for all k, so that q Q ^ q. Thus 
q = q. 

Observe that F has probability of being empty, i.e. q — 0, if and only if N ^ 
1 with probability 1. It is also not hard to show that F is empty with probability 
1, i.e. q = 1, if and only if either E(iV) < 1 or E(N) = 1 and P(N = 1) < 1. 
(These extinction probabilities are closely related to the theory of branching 
processes.) □ 
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Example 15.3. Random von Koch curve 

Let C be a random variable with uniform distribution on the interval (0, t). Let 
Eq be a unit line segment in R 2 . We form E\ by removing a proportion C from 
the middle of Eq and replacing it by the other two sides of an equilateral triangle 
based on the removed interval. We repeat this for each of the four segments in 
E\ independently and continue in this way to get a limiting curve F. Then with 
probability 1, dim H F = dim^F — 1.144 .... 

Calculation. This is a special case of Theorem 15.2. The set V may be taken as 
the isosceles triangle based on Eq and of height |v3- At each stage, a segment 
of length L is replaced by four segments of lengths |(1 — C)L, CL, CL and 
i(l — C)L, so we have m — A and C\ — C/\ — j(l — C) and C2 = C3 = C. 
Since C is uniformly distributed on (0, 4), expression (15.9) becomes 

/■1/3 

1 = E (2(i(l - C)) J + 2C S ) = 3 x 2[(±(1 - c)) s + c s ] Ac 

Jo 

or 

s + 1 = 12 x 2- (s+1) - 6 x 3 _(5+1) 

giving the dimension stated. □ 



15.2 Fractal percolation 

Random fractals can have complicated topological structure, for example they can 
be highly multiply connected. We consider topological aspects of a statistically 
self-similar construction known as 'fractal percolation'. In particular, this exhibits 
a 'phase transition', where the topological structure changes dramatically when 
a continuously varying parameter increases through a critical value. 

Let p be a number with < p < 1. We divide the unit square Eq into 9 
squares of side | in the obvious way. We select a subset of these squares to 
form E\ in such a way that each square has independent probability p of being 
selected. Similarly, each square of E\ is divided into 9 squares of side |, and 
each of these has independent probability p of being chosen to be a square of 
E2. We continue in this way, so that E^ is a random collection of £th level 
squares of side 3~ k . This procedure, which depends on the parameter p, defines 
a random fractal F p — Htlo ^*> see fig ures 15-4 and 15.5. (It is not difficult to 
describe this construction in precise probabilistic terms; for example, by taking 
the possible nested sequences of squares E^ as the sample space.) 

Proposition 15.4 

Given p, let t — q be the least positive solution of the equation 

t = (pt + l- p) 9 . 



252 



Random fractals 



Then F p is empty with probability q. If p ^ g then q — 1. If g < p < I then 
< q < 1 and, with probability 1 — q, dimeF,, = dintE^ = log 9/?/ log 3. 

Proof. Let Af be the (random) number of squares in E\. Then P(N = j) — 



)pH\ — p) 9 i , where ( 1 = is the binomial coefficient, so the 
' ' \r J r\(n-r)\ 



probability that F p — is, by (15.8), the smallest positive solution of 

9 



t 



= E ( 9 j) p i(l - p )9 ~ jtj = ^ + l - p )9 
7=0 v J 7 



Each square of E\ has side |, so (15.9) becomes 

1 = e( ECM = Eff3-j = 3- ! E(yV) = 3- s 9p 

(each of the nine squares of side i is selected with probability p, so the expected 
number chosen is 9p). Thus by Theorem 15.2, the almost sure dimension of F p , 
given it is non-empty, is log 9/?/ log 3. □ 

We now discuss briefly the qualitative way in which the random set F p changes 
as p increases from to 1 . We have already noted that F p is almost surely empty 





i* 






Figure 15.4 Steps in the construction of the random fractal discussed in Section 15.2 
with p = 0.6. The fractal obtained is shown in figure 15.5(a) 
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Figure 15.5 Random fractals realized by the percolation process discussed in Section 15.2 
with (a) p = 0.6 and (b) p = 0.8 
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ifO<p^Q. If„<p<3 we have, with probability 1, that either F p — or 
dirriH^p = log 9/?/ log 3 < 1, so by Proposition 2.5 F p is totally disconnected. At 
the other extreme, if p is close to 1, it is plausible that such a high proportion of 
the squares are retained at each stage of the construction that F p will connect the 
left and right sides of the square Eq\ when this happens we say that percolation 
occurs between the sides. We show that this is the case at least if p is very close 
to 1; the ridiculous bound 0.999 obtained can certainly be reduced considerably. 



Proposition 15.5 

Suppose that 0.999 < p < 1, Then there is a positive probability {in fact bigger 
than 0.9999) that the random fractal F p joins the left and right sides of Eq. 

*Proof. The proof depends on the observation that if I\ and I2 are abutting squares 
in Ek and both I\ and Ii contain either 8 or 9 subsquares of Ef, + \, then there is 
a subsquare in I\ and one in I2 that abut, so that the squares of E^ + i in 7j and 
li form a connected unit. 

We say that a square of E^ is full if it contains either 8 or 9 squares of E^ + \. 
We say that a square of E^ is 2-full if it contains 8 or 9 full squares of E^ + \, 
and, inductively, that a square of £j. is m-full if it contains either 8 or 9 squares 
of .Efc+i that are (m — l)-full. By the above remark, if Eq is m-full, then opposite 
sides of £0 are joined by a sequence of abutting squares of E m . 

The square £0 is m-full (m ^ 1) if either 

(a) Ei contains 9 squares all of which are (m — l)-full, or 

(b) Ei contains 9 squares of which 8 are (m — l)-full, or 

(c) Ei contains 8 squares all of which are (m — l)-full. 

Thus, if p m is the probability that £0 is m-full, we get, summing the proba- 
bilities of these three alternatives using (1.16), and using the self-similarity of 
the process, 

p m = P 9 p m . { + P 9 9p* m _i(l - Pm -i) + 9/(1 - p)pl_i 

= VpLi - 8/rV:_i (15-10) 

if m ^ 2. Furthermore, p\ — p 9 + 9p & (l — p) — 9p & — 8/? 9 , so we have a dis- 
crete dynamical system defined by p m — f(p m ~i) for m > 1, where pQ — 1 and 

f(t) = 9/? 8 f 8 - 8/?V. (15.11) 

Suppose that p — 0.999. Then (15.11) becomes 

fit) = 8.928 2515f 8 - 7.928 2874/ 9 
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and a little calculation shows that fg = 0.999 9613 is a fixed point of / which 
is stable in the sense that < f(t) — to ^ \{t — to) if fo < t ^ 1. It follows that 
p m is decreasing and converges to to as m — > oo, so there is a probability fo > 
that £o is m-full for all m. When this happens, opposite sides of Eq are joined 
by a sequence of squares in E m for each in, so the intersection F — Htlo ^* 
joins opposite sides of Eq. Thus, there is a positive probability of percolation 
occurring if p — 0.999, and consequently for larger values of p. □ 

We have seen that if < p < | then, with probability 1 , F p is empty or totally 
disconnected. On the other hand, if p > 0.999 then there is a high probability of 
percolation. The next theorem states that one or other of these situations pertains 
for each value of p. 

Theorem 15.6 

There is a critical number p c with 0.333 < p c < 0.999 such that if < p < p c 
then F p is totally disconnected with probability 1 , but if p c < p < 1 then there 
is positive probability that F p connects the left and right sides of Eq. 

Idea of proof. Suppose p is such that there is a positive probability of F p not 
being totally disconnected. Then there is positive probability of some two distinct 
points of F being joined by a path in F . This implies that there is a positive 
probability of the path passing through opposite sides of one of the squares in E^ 
for some k; by virtue of the statistical self-similarity of the construction, there 
is a positive probability of a path crossing Eq from left to right. Clearly, if F p 
has probability 1 of being totally disconnected, the same is true of F p i if p' < p. 
Thus the critical probability p c is the supremum of those p such that F p is totally 
disconnected, and has the properties stated. □ 

Experiment suggests that 0.7 < p c < 0.8. 

The change in form of F p as p increases through p c is even more dramatic 
than Theorem 15.6 suggests. Let F' be a random set obtained by tiling the 
plane with independent random copies of F p . If p < p c then, almost surely, F' 
is totally disconnected. However, if p ^ p c then, with probability 1, F' has a 
single unbounded connected component. Thus as p increases through p c there is 
a 'phase transition' as the isolated points of F' suddenly coalesce to form what 
is basically a single unit. The idea underlying the proof of this is that, if p > p c , 
then given that parts of F' lie in two disjoint discs of unit radius, there is a 
positive probability of them being joined by a path in F' There are infinitely 
many such discs in an unbounded component of F' so if F' had two unbounded 
components, there would be probability 1 of them being joined. 



15.3 Notes and references 

Various versions of the random fractal constructions of the type discussed 
in Section 15.1 are presented in Mandelbrot (1974, 1982), Kahane (1974), 
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Peyriere (1974), Falconer (1986b), Mauldin and Williams (1986a), Graf 
(1987), Graf, Mauldin and Williams (1988), Olsen (1994) and Hutchinson and 
Ruschendorf (2000). 

An interesting construction for fractals by random deletion was introduced by 
Mandelbrot (1972); see also Kahane (1985), Zahle (1984) and Falconer (1997). 
For other applications of probability to fractals, see Stoyan and Stoyan (1994) 
and Falconer (1997). 

The fractal percolation model was suggested by Mandelbrot (1974). Chayes, 
Chayes and Durrett (1988) analysed the phase transition in detail, and Dekking 
and Meester (1990) highlighted other phase transitions of the model. Chayes 
(1995) gives a clear survey on fractal percolation and its generalizations. 

Much has been written on 'discrete' percolation, where points are selected at 
random from a large square mesh (see Kesten (1982) and Grimmett (1999)), and 
there are many relationships between this and the fractal case. 



Exercises 

15.1 Find the almost sure Hausdorff dimension of the random Cantor set constructed 
by removing the middle third of each interval with probability j and the middle 
two-thirds of the interval with probability i at each step of the construction. 

15.2 Consider the following random version of the von Koch construction. We start with 
a unit segment. With probability j we replace the middle third of the segment by the 
other two sides of the (upwards pointing) equilateral triangle, and with probability i 
we remove the middle third altogether. We repeat this procedure with the segments 
that remain, in the usual way. Show that, with probability 1, this random fractal has 
Hausdorff dimension 1 . 

15.3 Show that the random von Koch curve depicted in figure 15.1 always has Hausdorff 
dimension s = log 4/ log 3 and, indeed, is an s-set. (This is not typical of random 
constructions.) 

15.4 Let < p < 1. We may randomize the construction of the Sierpinski gasket 
(figure 0.3) by selecting each of the three equilateral subtriangles independently 
with probability p at each step. (Thus we have a percolation process based on 
the Sierpinski gasket.) Show that if p < | then the limiting set F is empty with 
probability 1, but if | < p < 1 then there is a positive probability that F is non- 
empty. Find an expression for this probability, and show that, given F is non-empty, 
dim H F = log 3p/ log 2 with probability 1. 

15.5 For the random Sierpinski gasket described in Exercise 15.4 show that F is totally 
disconnected with probability 1 for every p < 1. (We regard two triangles as being 
joined if they touch at a vertex.) 

15.6 Consider the random Cantor set analysed in Theorem 15.1. With TC^iF) denoting 
the infimum of the sums in (2.1) over arbitrary coverings of F, show that 

?4(F) = min{l,?4(F n h) +H s 0o (F n h)} 

where s is the solution of (15.1). Use statistical self-similarity to deduce that, unless 
P(q + C| = 1) = 1, then, almost surely, H^(F) = 0, and thus H S {F) = 0. 
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15.7 Show that the martingale Xt given by (15.5) is L 2 -bounded, that is E(X|) ^ m for 
all k, for some number m. (Hint: Show that E(X^ +1 \J : i c ) ^ X\ + ay k where y = 
E(C[ S + C\ s ) < 1 and a is a constant, and then take unconditional expectations.) 

15.8 Consider the fractal percolation model F constructed with selection probability 
p, where p c < p < 1. Show that with probability 1, either F = or F contains 
infinitely many non-trivial connected components. (Here non-trivial means contain- 
ing at least two distinct points.) 



Chapter 16 Brownian motion and 
Brownian surfaces 



In 1827 the botanist R. Brown noticed that minute particles suspended in a liquid 
moved on highly irregular trails. This, and a similar phenomenon for smoke par- 
ticles in air, was explained ultimately as resulting from molecular bombardment 
of the particles. Einstein published a mathematical study of this motion, which 
eventually led to Perrin's Nobel Prize-winning calculation of Avogadro's number. 

In 1923 Wiener proposed a rigorous mathematical model that exhibited random 
behaviour similar to that observed in Brownian motion. The paths described by 
this 'Wiener process' in 3-dimensional space are so irregular as to have Hausdorff 
dimension equal to 2. This is a good example of a natural phenomenon with a 
fractal appearance that can be explained by a simple mathematical model. 

A trail may be described by a function / : K — > W where /(/) is the position 
of a particle at time /. We can study / from two differing viewpoints. Either 
we can think of the trail or image /([*i, £2]) = {/(0 : h ^ t ^ ?2j as a subset 
of E." with t regarded merely as a parameter, or we can consider the graph of 
/, graph/ = {(?, /(f)) : h ^ t ^ t 2 ] C W +l , as a record of the variation of / 
with time. Brownian trails and their graphs are, in general, fractals. 

In this chapter, our aim is to define a probability measure on a space of func- 
tions, such that the trails likely to occur resemble observed Brownian motion. 
We begin by investigating the fractal form of classical Brownian motion, and 
then we examine some variants that have been used to model a wide variety 
of phenomena, including polymer chains, stock market prices and topographi- 
cal surfaces. 

16.1 Brownian motion 

We first define Brownian motion in one dimension, and then extend the definition 
to the higher-dimensional cases. 

To motivate the definition, let us consider a particle performing a random walk 
on the real line. Suppose at small time intervals r the particle jumps a small dis- 
tance 5, randomly to the left or to the right. (This might be a reasonable descrip- 
tion of a particle undergoing random molecular bombardment in one dimension.) 
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Let X r (t) denote the position of the particle at time t. Then, given the position 
X T (kx) at time kx, the position X T ((k + l)r) at time (k + \)x is equally likely 
to be X T (kx) + S or X T (kx) — S. Assuming that the particle starts at the origin at 
time 0, then for / > 0, the position at time t is described by the random variable 

X T (t) = S(Y 1 + ■ ■ ■ + Y yM ) 

where Y\, Yi, . . . are independent random variables, each having probability \ 



2 

of equalling 1 and probability j of equalling —1. Here \J/x\ denotes the largest 
integer less than or equal to t/x. We normalize the step length by setting 8 — *Jx 
so that 

X T (t) = Vr~(Yi + --- + Y lt/ri ). (16.1) 

The central limit theorem (see (1.26)) tells us that, for fixed t, if x is small 
then the distribution of the random variable X T it) is approximately normal with 
mean and variance x \t/x\ ~ t , since the Yj have mean and variance 1. In the 
same way, if / and h are fixed, and x is sufficiently small, then X T {t + h) — X T (t) 
is approximately normal with mean and variance h. We also note that, 
if ^ t\ ^ ?2 ^ ■ ■ ■ ^ hm, then the increments X x (ti) — X T (t\), X T (t/i) — 
X z (tj,), . . . , X T (t2m) — X T (t2m-\) are independent random variables. We define 
Brownian motion with the limit of the random walk X T (f ) as x —> in mind. 

Let (Q , T, P) be a probability space. For our purposes we call X a random 
process or random function from [0, oo) to E. if X(t) is a random variable for 
each t with < / < oo. Sometimes, we consider random functions on a finite 
interval \t\ , t2] instead, in which case the development is similar. (In the formal 
definition of a random process there is an additional 'measurability' condition, 
which need not concern us here.) Of course, we think of X as defining a sample 
function t H> X(a>, t) for each point a> in the sample space £2. Thus we think of 
the points of Q as parametrizing the functions X : [0, oo) — > IR, and we think of 
P as a probability measure on this class of functions. 

We define Brownian motion or the Wiener process to be a random process X 
such that: 

(BM) (i) with probability 1, X(Q) — (i.e. the process starts at the origin) 
and X(t) is a continuous function of f; 
(ii) for all t ^ and h > the increment X(t + h) — X(t) is normally 
distributed with mean and variance h, thus 

2 
1/2 



P(X(t + h)- X{t) ^ x) = (2irhy l ' z / exp du; 



y_oo exp { 2/i 



(16.2) 

(iii) if ^ t\ ^ ti ^ • • • ^ h m , the increments X(t 2 ) - X(t x ), X(t 4 ) - 
X(t2), . . . , X(t2,„) — X{t2m~i) are independent. 

Note that it is immediate from (i) and (ii) that X (f ) is itself normally distributed 
with mean and variance t for each /. Observe that the increments of X are 
stationary; that is, X(t + h) — X(t) has distribution independent of t. 
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The first question that arises is whether there actually is a random function 
satisfying the conditions (BM). It is quite hard to show that Brownian motion 
does exist, and we do not do so here. The proof uses the special properties 
of the normal distribution. For example, given that Xfe) — X(t\) and X{t^) — 
X(t2) are independent and normal with means and variances ?2 — h and t$ — 
?2 respectively, the sum X(tT,) — X{t\) is necessarily normal with mean and 
variance t-*, — t\\ see (1.24) et seq. This is essential for the definition (BM) to be 
self-consistent. It should at least seem plausible that a continuous process X(t) 
satisfying (BM) exists, if only as a limit of the random walks X T (t) as r — > 0. 

Instead of proving existence, we mention three methods of simulating Brow- 
nian sample functions, for example with a computer. Each method can, in fact, 
be used as a basis for existence proofs. The first method uses the random walk 
approximation (16.1). Values of 1 or —1 are assigned by 'coin tossing' to Y t 
for 1 ^ i ^ m, where m is large, and X T (t) is plotted accordingly. If r is small 
compared with t , then this gives an accurate approximation to a Brownian sample 
function. 

The 'random midpoint displacement' method may be used to obtain a sample 
function X : [0, 1] -► R. We define the values of X(k2~ j ) where ^ k ^ V by 
induction on j. We set X(0) — and choose X(l) at random from a normal 
distribution with mean and variance 1. Next we select X(j) from a normal 
distribution with mean j(X(0) + X(l)) and variance j. At the next step X(^) 
and X(j) are chosen, and so on. At the y'th stage the values X{k2~') for odd k 
are chosen independently from a normal distribution with mean j(X((k — 1)2~ 7 ) 
+ X((k + 1)2~ 7 )) and variance 2~ J . This procedure determines X(t) at all binary 
points / = klr> . Assuming that X is continuous, then X is completely deter- 
mined. It may be shown, using properties of normal distributions, that the func- 
tions thus generated have the distribution given by (BM). 

Brownian motion on [0, 7r] has a Fourier series representation 



1 fl °° 

x(f) = T c„( + J-Vc, 

ITT V 7T ' * 



k=\ 



s'm kt 

k — ; — 



where the Q have independent normal distributions of mean and variance 1. 
Choosing the Ck at random and curtailing the series gives an approximation to 
a sample function. The graph of a Brownian sample function is in figure 16.1. 

It is easy to extend the definition of Brownian motion from R. to R" : we just 
define Brownian motion on R" so that the coordinate components are indepen- 
dent 1-dimensional Brownian motions. Thus X : [0, oo) — > R n given by X(t) — 
(X\ (t ),..., X n (t)) is an n-dimensional Brownian motion on some probability 
space if the random process X,(f) is a 1-dimensional Brownian motion for each 
i, and X\ (t i ),..., X n (t n ) are independent for all sets of times ti, . . . , t„. A sample 
trail of Brownian motion in IR 2 is shown in figure 16.2. 

By definition, the projection of X(t) onto each of the coordinate axes is a 
1-dimensional Brownian motion. However, the coordinate axes are not special 
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Figure 16.1 Graph of a realization of Brownian motion 

in this respect: n -dimensional Brownian motion is isotropic; that is, it has the 
same characteristics in every direction. To see this, consider, for convenience, the 
case of 2-dimensional Brownian motion X(t) — (X\{t), ^(f))- The projection 
of X{t) onto the line Lg at angle 8 through the origin is X\(t) cos6> + ^(f) sin#. 
For / ^ and h > the random variables X t (t + h) — X\{t) and X^(t + h) — 
Xiit) are independent and normally distributed with means and variances h. 
Thus the increments of the projection onto Lg , given by 

(Xi(t + h)- Xi(t)) cos6 + (X 2 (t + h)- X 2 (t)) sin#, 

are normally distributed with mean and variance h cos 2 6 + h sin 8 — h; see 
(1.24) et seq. In a similar way, the increments of the projection are independent, 
so the projection of X(t) onto Lg is 1-dimensional Brownian motion, for all 
angles 8. 




Figure 16.2 A simulation of a Brownian trail in 
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If y > 0, replacing h by yh and x by y^ 2 x does not alter the value of the 
right-hand side of (16.2) (by substituting u\ — uy~ 1 ^ 2 in the integral). Thus 

P(Xi(t + h)- Xi(t) ^ Xi ) = P(Xi(yt + yh) - X t (yt) ^ y 1/2 xi) 

for all Xt. It follows that X(t) and y~ l ' 2 X(yt) have the same distribution. Thus 
changing the temporal scale by a factor y and the spatial scale by a factor y 1 / 2 
gives a process indistinguishable from the original so we term Brownian motion 
X(t) and its graph statistically self-affine. But we call Brownian trails statistically 
self- similar, in that the spatial distribution of the trails {X(t) : ^ t ^ T] and 
{X(t) : ^ t ^ yT] are indistinguishable except for a scale factor of y 1 ^ 2 . 

Suppose that X(t) — (Xi(t), . . . , X n (t)) is n-dimensional Brownian motion. 
Since X t (t + h) — X t it) has independent normal distribution for each i, it follows 
from (16.2) that if [a,-, b{\ are intervals, then 

P(X,(f + h)- Xi(f) e [a h b t ]) = (2nh)- 1/2 j ' exp (-^\ dx t . 

Hence if E is the coordinate parallelepiped \a\, by] x ■ ■ ■ x [a n ,b n ] 

P(X(t + h)-X(t) e E) = Y[ (27Thy 1/2 exp ( - -?- ) djc, 

= (2irh)- n/2 J exp (-^- ) dx (16.3) 

where x = (xi, . . . , X„). By approximating sets by unions of such parallelepipeds, 
it follows that (16.3) holds for any Borel set E. Thus, taking E as the ball 5(0, p), 
and converting into polar coordinates, 

P(\X(t + h)-X(t)\ ^ p) = ch~ n/2 / r n ' 1 exp ( -— ) dr (16.4) 

where c — 2ic~ n ^ 2 a„ with a n the (n — l)-dimensional area of the surface of 
B(0, 1). 

A fundamental property of a Brownian motion is that, with probability 1, the 
sample functions satisfy a Holder condition of exponent X for each X < i. 

Proposition 16.1 

Suppose < X < j. With probability 1 the Brownian sample function 
X : [0, 1] -+ R" satisfies 

\X(t + h)-X(t)\^b\h\ x {\h\ < //o) (16.5) 

for some Hq > 0, where b depends only on X. 
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Proof. For h > we have, by (16.4), 



P(l 



X(t + h)- X(t)\ > h x ) = eh'" 12 f r"' 1 exp ( — — j dr 

f°° (-u 2 \ 
— c I u" exp I I dw 

J/,^-1/2 \ 2 / 

/•OO 

^ Ci / exp(— M)d« 

= Ciexp(-/i A_1/2 ) 

^ c 2 /T 2 (16.6) 

after a substitution u — rh~ 1 / 2 and some sweeping estimates, where C\ and c 2 do 
not depend on h or f . Taking [t, t + h] as the binary intervals [(m — 1)2~ ; , m1~'~\ 
we have 

P(|X((m - 1)2 - -') - X (m2^')| > 2~ jX for some ; > £ and 1 < m ^ 2 J ) 

00 

^c 2 £2>2- 2 ' 

= c 2 2" i+1 . 

Thus with probability 1 there is an integer K such that 

|X((m - 1)2~ J ) - X(m2- j )\ sC 2~ jX (16.7) 

for all j > K and 1 ^ m ^ 2 J . If A < ff = 2"* the interval [t, t + h] may, 
except possibly for the endpoints, be expressed as a countable union of contiguous 
binary intervals of the form [(m — 1)2~ J , m2~ ; ] with 2~- / SC h and with no more 
than two intervals of any one length. (Take all the binary intervals in [t, t + h] 
not contained in any other such intervals.) Then, using the continuity of X, if k 
is the least integer with 2~ k $C h, 

00 2~ tA 2 2h x 

Z^ (1 _ 2-A) ^ (1 - 2~ k ) 

Theorem 16.2 

Mf/z probability 1 , a Brownian trail in W (n ^ 2) /las Hausdorff dimension and 
box dimension equal to 2. 

Proof. For every A, < ^, X : [0, 1] — >• W 1 satisfies a Holder condition (16.5) with 
probability 1, so by Proposition 2.3, dim H Z([0, 1]) ^ (l/A,)dim H [0, 1] < 1/1, 
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with a similar inequality for box dimensions. Thus, almost surely, Brownian 
trails have dimension at most 2. 

For the lower bound we use the potential theoretic method. Take I < s < 2. 
For given t and h let p(p) denote the expression in (16.4). Taking the expectation, 
it follows that 

/•OO 

E(\X(t + h)-X(t)\- s )= / r~'dp(r) 

Jo 

= chT nl2 f r-' ?+ "" 1 exp ( — ) dr 



{ch-^ 



r w («- s -2)/2 /^\ dw 



= cih' s/2 (16.8) 

after substituting w — r 2 / h, where C\ is independent of h and /. Then 

E ' 



/ / \X(t) -X(u)\~ s dt du] = / / E(\X(t) - X(u)\~ s )dt du 

Jo Jo ) Jo Jo 

= ci\t - u\~ s/2 dt du 

Jo Jo 

< oo (16.9) 

since s < 2. There is a natural way of defining a mass distribution \if on a 
trail /, with the mass of a set equal to the time the trail spends in the set, 
i.e. fj.f (A) — C{t : ^ t $C 1 and /(f) e A] where C is Lebesgue measure. Then 
f g(x)dfif(x) — L g(f(t))dt for any function g, so (16.9) becomes 



//'- 



y\ s dfi x (x)dnx(y) < oo. 



Hence if s < 2 then ff \x — y\ s diix(x)d/j J x(y) < oo with probability 1, where 
Hx is a mass distribution on X(t), so dimeX([0, 1]) ^ s by Theorem 4.13(a). 

□ 

In fact, with probability 1, Brownian trails in W(n ^ 2) have 2-dimensional 
Hausdorff measure 0. More delicate arguments involving the finer definitions 
of dimension given in Section 2.5 show that, with probability 1, the trails 
X([0, 1]) have positive finite measure with respect to the function h(t) — 
t 2 log(l/f) logloglog(l/f), if n = 2, and with respect to h(t) — f 2 loglog(l/f), 
if n ^ 3. In this sense, Brownian trails have a dimension that is 'logarithmically 
smaller' than 2. 

An obvious qualitative question about Brownian trails is whether they are 
simple curves, or whether they are self- intersecting. Given a function /, we 
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say that x is a point of multiplicity k if /(f) = x for k distinct values of /. 
Dimensional methods may be used to determine whether Brownian functions 
have multiple points. 

Theorem 16.3 

With probability 1, a Brownian sample function X : [0, oo) — > W has multiple 

points as follows: 

n — 2: there are points of multiplicity kfor every positive integer k; 

n — 3: there are double points but no triple points; 

n Jj 4: there are no multiple points. 

Idea of proof. One approach is to use the intersection theorems of Chapter 8. 
For the case n — 3, suppose that dim H (A'([0, 1]) D X([2, 3])) < 1 with prob- 
ability 1. Using isotropy and scaling of Brownian motion it is not difficult 
to see that this implies that dim H (^([0, 1]) n cr(X([2, 3]))) < 1 with proba- 
bility 1 for any similarity transformation a. It follows that, with probability 
1, diiriH(X([0, 1]) D a(X([2, 3]))) < 1 for almost all similarities a. Since, by 
Theorem 16.2, dimn ^"([0, 1]) = diiriBX([0, 1]) = 2 with probability 1, this con- 
tradicts Theorem 8.2(a), and we conclude that dim H (X([0, 1]) n X([2, 3])) = 1 
with positive probability p, say. Using the statistical self- similarity of X(t) it 
follows that dim H (X([t, t + 8]) n X([t + 28, t + 35])) = 1 with probability p for 
every t and 8, so, since the increments are independent, the set of double points 
has Hausdorff dimension 1 with probability 1 . 

Similar techniques may be used to prove the other results. □ 

The derivation of the almost sure dimension of Brownian graphs is similar to 
that for Brownian trails. 

Theorem 16.4 

With probability 1, the graph of a Brownian sample function X : [0, 1] — »• E. has 
Hausdorff and box dimension l|. 

Proof. From the Holder condition (16.5) and Corollary 11.2(a) it is clear that, 
with probability 1, graph X has Hausdorff dimension and upper box dimension 
at most 2 — X for every X < j, so has dimensions at most 1^. For the lower 
estimate, as in the proof of Theorem 16.2, 



poo 

X{t + h)- X(t)\ 2 + h 2 )' s/2 ) = / (r 2 + h 2 r s/2 dp(r) 

Jo 

= ch~ l/2 f (r 2 + h 2 )" s/2 exp ( — ) dr 



Jo 



(wh + h 2 y s/2 w~ l/2 exp ( — ) dw 
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-h 



^\c f (h 2 y s/2 w~ 1/2 dw 
Jo 

/ (wh)~ s/2 w~ 1/2 dw 

Jh 



Jh 
^ Cl h 1/2 ~ s 

on splitting the range of integration and estimating the integral in two ways. 
We may lift Lebesgue measure from the f-axis to get a mass distribution \if on 
the graph of a function / given by /X/(A) = C{t : ^ t ^ 1 and (t, /(?)) e A}. 
Using Pythagoras' Theorem, 



fh- 

-ff 

Jo Jo 
Jo Jo 



y\ s dVx(x)dn x (y) 
E((|X(f) - X(u)\ 2 + \t - «| 2 )- l/2 )d/ Au 



,1/2-, 



s: / / ci|f- M | 1/z ~ s dr du 
< oo 



if s < 1 j. With probability 1, the mass distribution fi x on graph X is posi- 
tive and finite and has finite ^-energy, so Theorem 4.13(a) gives dimegraphX 
>li. □ 

Since, with probability 1, the graph of X over any interval has dimension lj, 
it is immediate that Brownian functions, though continuous, are not continu- 
ously differentiable. In fact, with probability 1, a Brownian function is nowhere 
differentiable. 

As with Brownian trails, Brownian graphs have dimension logarithmically 
smaller than lj\ with probability 1, the graph of X over the range [0, 1] has 
positive finite measure with respect to the function h(t) — ; 3 / 2 loglog(l/0- 

The sets of times at which a Brownian sample function takes particular values 
are often of interest. If / : [0, 1] — > K is a function, we define the level sets 
f~ l {c) — {t : fit) — c) for each value of c. The level sets are, essentially, the 
intersections of the graph of / with lines parallel to the /-axis. 

Proposition 16.5 

With probability 1, a Brownian sample function X : [0, 1] — »• E. satisfies 
I 

2 



dimnX '(c) $; \ for almost all c (in the sense of l-dimensional Lebesgue 



measure). Moreover, for any given c, dimeX (c) = j with positive probability. 

Note on proof. With probability 1, we must have dim H X~' (c) — dim H ((graph X)D 
L c ) ^ \ for almost all c, where L c is the line y — c; otherwise Corollary 7.10 
would imply that dim H graph X > lj, contradicting Theorem 16.4. 
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It is much harder to show that dimuX~ l (c) = j with positive probability. The 
argument is not unlike that indicated for the proof of Theorem 8.2. □ 



16.2 Fractional Brownian motion 

Brownian motion, although of central theoretical importance, is, for many pur- 
poses, too restrictive. The graph of a Brownian sample function has dimension 1 j 
almost surely but random functions with graphs of other dimensions are required 
for a variety of modelling purposes. 

Brownian motion X : [0, oo) — > E. is a Gaussian process. This means that for 
^ t\ SC fe ^ • • • ^ t m and scalars k\, . . . , X m , the random variable XiX(ti) + 
■ ■ ■ + k m X(t m ) is normal (we say that (X(t\), . . . , X(t m )) is multivariate normal). 
In fact Brownian motion is the unique probability distribution on functions which 
has independent increments that are stationary and of finite variance. To obtain 
sample functions with different characteristics it is necessary to relax one or more 
of these conditions. 

There are two usual variations. Fractional Brownian motion has increments 
which are normally distributed but no longer independent. Levy processes, on 
the other hand, dispense with the normal distribution condition and this can lead 
to discontinuous functions. For simplicity, we just discuss these processes in 
the 1-dimensional case; analogous processes may be defined taking values in 
n-dimensional space. 

Fractional Brownian motion ofindex-a (0 < a < 1) is defined to be a Gaussian 
process X : [0, oo) — ► R on some probability space such that: 

(FBM) (i) with probability 1, X(t) is continuous and X(0) — 0; 

(ii) for every t ^ and h > the increment X(t + h) — X(t) has 
the normal distribution with mean zero and variance h 2a , so that 

P(X(t + h) - X(t) ^ x) = (2iry l/2 h- a / exp(-u 2 /2h 2a )du. 

J — OO 

(16.10) 

It may be shown that, for < a < 1, a process satisfying (FBM) exists. 
Figure 16.3 shows graphs of fractional Brownian motion for various a. 

It is implicit in the above definition that the increments X(t + h) — X(t) are 
stationary; that is, they have probability distribution independent of t. How- 
ever, the distribution of functions specified by (FBM) cannot have independent 
increments except in the Brownian case of a — j. By conditions (i) and (ii), 
E(X(t) 2 ) = t 2a and 

E((X(t + h)-X(t)) 2 ) = h 2a 

from which 

E(X(t)X(t + h)) = \\t 2a + (t + h) 2a - h 2a ] 
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so that 

E(X(t)(X(t + h)- X(t))) = i[(f + h) 2a - t 2a - h 2a ] (16.11) 

which is non-zero if a / \. Hence E((X(f) - X(Q))(X(t + h) - X(t))) is posi- 
tive or negative according to whether a > j or a < j. Thus, the increments are 
not independent — if a > \ then X(t) - X(0) and X(t + h) - X{t) tend to be of 
the same sign, so that X(t) tends to increase in the future if it has had an increas- 
ing history. Similarly, if a < \ then X{t) - X(0) and X(t + h) - X(t) tend to 
be of opposite sign. Note also that (16.10) implies that the process is self-affine, 
that is the scaled paths y~ a X(yt) have the same statistical distribution as X(t) 
for y > 0. 

The almost sure dimension of fractional Brownian graphs may be determined 
in a similar way to the strict Brownian case. 

Proposition 16.6 

Suppose < X < a. With probability 1, an index-a Brownian sample function 
X : [0, 1] -» R. satisfies 

\X(t + h)-X(t)\^b\h\ l (16.12) 

if \h\ < Ho, for some Ho > and b > 0. 

Note on proof. Provided that X < i, the proof goes through as in Proposition 
16.1, using (16.10) instead of (16.4). However, if a > X ^ j this leads to an 
estimate C2h l ^ 2 ~ a in place of (16.6) and rather more sophisticated techniques 
from probability theory are required to show that the Holder condition (16.12) is 
valid uniformly for all f. □ 

Theorem 16.7 

With probability 1 the graph of an index-a Brownian sample function X : [0, 1] — *■ 
K has Hausdorff and box dimensions 2 — a. 

Proof. Corollary 11.2(a) together with the Holder condition (16.12) show that 
the dimension is almost surely at most 2 — a. The lower bound is obtained as in 
Theorem 16.4 using the probability distribution (16.10). □ 

The autocorrelation theory discussed in Section 11.2 may be applied to frac- 
tional Brownian functions. It is convenient to assume that X is defined for all 
time, i.e. X : (— oo, oo) — > R. This requires only trivial modification to the defi- 
nition (FBM). Since the variance E(\X(t + h) — X(t)\ 2 ) tends to infinity with h, 
we have 



-/ 



lim E ( — / X(t) z dt ) = ex) 

T—>ck 
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16.3 Realizations of index-a fractional Brownian for a = 0.1, 0.3, 0.5, 0.7, 0.9 



so the sample functions tend to have an infinite mean square. Nevertheless, 

>7 



2T j_. 



(X(t + h)- X(t)ydt 



-f 

IT J-T 



E(X(t + h)- X(t)) 2 dt = h 2a 



It may be deduced that 'on average' the sample functions satisfy 



-f 



(X{t + h) - X{t)) 2 At ~ ch 2a 
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and, according to (11.18) and (11.19), this does indeed correspond to a graph of 
dimension 2 — a. Taking this parallel further, we might expect X{t) to have a 
power spectrum (11.15) approximately l/a> l+2a . 

Simulation of fractional Brownian motion is much more awkward than Brow- 
nian motion, because the increments are not independent. However, with modern 
computing power, exact simulation is possible. To get a realization of index-a 
fractional Brownian motion X at points t\, . . . , t k , we recall that the covariances 
of X are given by the covariance matrix 

A Uj = E(X(fi)X(fj)) = \(\ti\ 2a + \tj\ 2a - \ti - tj\ 2a ). 

Then, if M is a matrix such that MM T — A, where T denotes transpose, and 
V — (Vj, . . . , V k ) is a vector of independent random variables with normal dis- 
tribution of mean and variance 1, X — X(ti) — MV has the distribution of 
index-a fractional Brownian motion at the points f,. To see this we check the 
covariances, using matrix notation: 

E(X(ti)X(tj)) = E(XX T ) = E(MV(MV) T ) 

= E(MVV T M T ) = ME(VV T )M T = M I M T = A. 

Thus, by drawing a sequence of independent normally distributed random num- 
bers for the Vi , the product X — M V gives a realization of index-a fractional 
Brownian motion. Finding a matrix M such that A has 'Choleski decomposi- 
tion' A — MM T is computationally intensive if k is large, and there are various 
numerical approaches to facilitate this. 

An alternative method of constructing functions with characteristics similar 
to index-a Brownian functions is to randomize the Weierstrass function (11.4). 
Consider the random function 



X(t) = J2c k X- ak sin (X k t + A k ) (16.13) 

k=\ 

where X > 1 and where the Ck are independent random variables with the normal 
distribution of zero mean and variance one, and the 'phases' At are independent, 
each having the uniform distribution on [0, 2jt). Clearly E(X(t + h) — X(t)) — 0. 
Furthermore, using the formula for the difference of two sines, 



E(X{t + h)-X(t)f = E I ^C k ir ak 2s,in{\\ k h)cos(X k (t+ \h) + A k ) 
= 2^X- lak sin 1 (\\ k h) 



k=\ 
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using that E(QCj) = 1 or according as to whether k — j or not, and that the 
mean of cos 2 (a + A^) is i. Choosing N so that X~ ( - N+l '> $C h < X~ N , gives 



E(X(t + h)-X(t)) 2 ~ \^X- 2ak X 2k h 2 + 2 ^ *■' 



2a k 



k=\ k=N+l 

~ ck~ 2aN 
ntch 2a 

in the sense that < c\ sC E(X(t + h) - X(t)) 2 /h 2a ^ c 2 < oo for h < 1. Thus 
(16.13) has certain statistical characteristics resembling index-a fractional Brow- 
nian motion, and provides a usable method for drawing random graphs of various 
dimensions. Such functions are often used in fractal modelling. A value of 
a — 0.8, corresponding to a graph of dimension 1 .2, is about right for a 'mountain 
skyline' . 

As might be expected, the level sets of index-a Brownian sample functions 
are typically of dimension 1 — a. Proposition 16.5 generalizes to give that, 
with probability 1, dimHX _1 (c) $C 1 — a for almost all c, and that, for given 
c, dim H X _1 (c) = 1 — a with positive probability. 

A further generalization of fractional Brownian motion is multifractional 
Brownian motion, which looks and behaves like index-a (?) fractional Brownian 
motion close to t, where a : [0, oo) — > (0, 1) is a given continuous function. 

16.3 Levy stable processes 

An alternative generalization of Brownian motion gives the Levy stable processes. 
A Levy process is a random function X : [0, oo) — > E. such that the increments 
X(t + h) — X(t) are stationary, i.e. with distribution depending only on h, and 
independent, i.e. with Xfo) — X(t\), ■ ■ ■ , ^(hm) — ^fem-i) independent if ^ 
h ^ h ■ ■ ■ ^ h,„ ■ However, except in very special cases such as Brownian motion, 
Levy processes have infinite variance and are discontinuous with probability 1. 
A Levy process is stable if a self- affinity condition holds, that is y~ < a X(yt) 
and X(t) have the same distribution for all y > 0, and some a. 

It is not, in general, possible to specify the probability distribution of stable 
processes directly. Fourier transforms are usually used to define such distribu- 
tions, and analysis of the dimensions of graphs and trails of stable processes 
requires Fourier transform methods. 

The probability distribution of a random variable Y may be specified by its 
characteristic function, i.e. the Fourier transform E(exp(iuY)) for u e R. To 
define a stable process, we take a suitable function t/r : E. — >• E. and require that 
the increments X(t + h) — X(t) satisfy 

E(e\p(iu(X(t +h)- X(t)))) = exp(-hir(u)) (16.14) 

with X(t 2 ) - X(ti), ... , X(t 2m ) - X(t 2m -i) independent if ^ t\ ^ t 2 ^ • • • ^ 
t 2m . Clearly the increments are stationary. This definition is, at least, consistent 
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in the following sense. If t\ < t 2 < h, then, taking expectations and using 
independence, 

E(expO- M (X(f 3 ) - X{h)))) = E(exp/ M ((X(f 3 ) - X(t 2 )) + (X(t 2 ) - Xfo)))) 

= E(expiu(X(t 3 ) - X(t 2 )))E(expiu(X(t 2 ) - X(r,))) 
= exp(-(f 3 - t 2 )f{u)) exp(-(f 2 - h)f{u)) 
= exp(-(f 3 - t{)f(u)). 

It may be shown that, for suitable i/r, stable processes do exist. 

Taking \fr(u) — c\u\ a with < a ^ 2, gives the stable symmetric process of 
index-a. Replacing h by yh and u by y~ l ' a u leaves the right-hand side of 
(16.14) unaltered, and it follows that y~ x l a X(yt) has the same statistical dis- 
tribution as X(t) so such processes are self-affine. The case a — 2 is standard 
Brownian motion. 

Theorem 16.8 

With probability 1, the graph of the stable symmetric process of index-a has 
Hausdorff and box dimension max{l, 2 — \/a). 



Partial proof. We show that dim H graph X ^ dim B graph X ^ max{l,2— 1/a}. 
Write Rf[t\, t 2 ] — sup{|/(f) — f(u)\ :t\ ^ t,u ^ t 2 ] for the maximum range of 
a function / over the interval \t\, t 2 ]. By virtue of the self-affinity of the process 
X, for any t and < S < 1 

E(R x [t,t + 8]) = 8 l/a E(R x [0,l]). 

If Ns squares of the (5-coordinate mesh are intersected by graph X, it follows 
from Proposition 11.1 that 

m— 1 

E(N S ) ^2m + r l J2 E(RxV8, 0' + 1)5]) 

;=o 

= 2m + m8- l 8 1/a E(R x [0, 1]) 

where m is the least integer greater than or equal to 1/8, so m ^ 2/8. It may 
be shown, and is at least plausible, that E(R x [0, 1]) < oo, so there is a con- 
stant c such that E(N S 8^) < c for all small 8, where yf3 = max{l, 2 — 1/a}. Then 
E(A^ +£ ) s: c8 e if e > 0, so that 

(OO \ OO 

Y,N 2 M2~ k f +E \ < cj^(2~ k r <oo. 
k=l I k=\ 

It follows that, with probability 1, YlT=i N 2 -k(2~ k )P~ s < oo. In particular, 
N$8P~ E — > as 8 — 2~ k — > 0, so dime graph X SC /3 with probability 1 using 
Proposition 4.1. □ 
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If a < 1 then almost surely dim H graphX = 1, the smallest dimension pos- 
sible for the graph of any function on [0,1]. This reflects the fact that the 
sample functions consist of 'jumps', with infinitely many jumps in every 
positive time interval, though many such jumps are very small. The image 
of X, that is {X(t) : ^ t ^ 1}, has dimension a with probability 1, which 
is indicative of the distribution of the jumps. It may be shown that the prob- 
ability of there being k jumps of absolute value at least a in the interval 
[t,t + h] is (ha~ a ) k exp(— ha~ a )/k\, corresponding to a Poisson distribution 
of mean ha~ a . 

If 1 < a < 2 the stable symmetric process combines a 'continuous' component 
and a 'jump' component. 



16.4 Fractional Brownian surfaces 

We end this chapter with a brief discussion of fractional Brownian surfaces which 
have been used very effectively in creating computer-generated landscapes. 

We replace the time variable t by coordinates (x, y) so the random variable 
X(x, y) may be thought of as the height of a surface at the point (x, y). 

ForO < a < 1 we define an index-a fractional Brownian function X : R 2 — > E. 
to be a Gaussian random function such that: 

(FBS) (i) with probability 1, X(Q, 0) = 0, and X(x, y) is a continuous func- 
tion of (x, y)\ 
(ii) for (x, y), (h, k) e U 2 the height increments X(x + h, y + k) — 
X(x, y) have the normal distribution with mean zero and variance 

(h 2 + k 2 ) a = \(h,k)\ 2a , thus 

?{X(x + h,y + k)-X(x,y) ^ z) 



Some effort is required to demonstrate the existence of a process satisfying these 
conditions. The correlations between the random variables X{x,y) at different 
points are quite involved. 

We term {(x, y, X(x, y)) : (x, y) e IR 2 )} an index-a fractional Brownian sur- 
face. Some sample surfaces are depicted in figure 16.4. 

Comparing (16.15) with the distribution (16.10) we see that the graph obtained 
by intersecting X(x, y) with any vertical plane is that of a 1 -dimensional index-a 
Brownian function (after adding a constant to ensure X(Q) — 0). We can often 
gain information about surfaces by considering such vertical sections. 

Theorem 16.9 

With probability 1, an index-a Brownian sample surface has Hausdorff and box 
dimensions equal to 3 — a. 
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Figure 16.4 Index-a fractional Brownian surfaces: 
(b) a = 0.8, dimension = 2.2 



(a) a = 0.5, dimension = 2.5; 
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Proof. Using similar methods to Propositions 16.1 and 16.6, it may be shown 
that if X < a then, with probability 1, an index-a Brownian function X : [0, 1] x 
[0, 1] -> R satisfies 

\X(x + h,y + k)- X(x, y)\ ^ b(h 2 + k 2 ) l/2 = b\(h, k)\ x 

provided that (h, k) is sufficiently small. The analogue of Corollary 11.2(a) for a 
function of two variables (see Exercise 1 1 .9) then gives 3 — k as an upper bound 
for the upper box dimension of the surface. 

If we fix Xo, then X(xq, y) — X(xq, 0) is an index-a Brownian function on 
[0, 1], so by Theorem 16.7 X(xq, y) has graph of Hausdorff dimension 2 — a 
with probability 1. Thus, with probability 1, the graph of X(xq, y) has dimension 
2 — a for almost all ^ Xq ^ 1 . But these graphs are just parallel sections of the 
surface given by X, so, by the obvious analogue of Corollary 7.10 in IR 3 , the 
surface has Hausdorff dimension at least (2 — a) + 1. □ 

The level sets X~ l (c) = {(x, y) : X(x, y) — c] are the contours of the random 
surface. Proposition 16.5 extends to index-a surfaces. It may be shown that, with 
probability 1, dimuX~ 1 (c) ^ 2 — a for almost all c (in the sense of 1-dimensional 
measure), and that dmin^" - '(<:•) = 2 — a with positive probability. Thus the con- 
tours of index-a surfaces have, in general, dimension 2 — a. 

The problems of generating sample surfaces for index-a Brownian functions 
are considerable, and we do not go into details here. However, we remark that 
an analogue of (16.13) for index-a surfaces is 

00 

X(x, v) = ^2 c k^ ak sin (X k (x cos B k + y sin B k ) + A k ) 

k=i 

where the C k are independent with normal distribution of mean zero and vari- 
ance 1, and the A k and B k are independent with uniform distribution on [0, 2n). 
Such functions provide one possible approach to computer generation of ran- 
dom surfaces. 

The ideas in this chapter may be extended in many directions and combined 
in many ways. Fractional Brownian motion and stable processes may be defined 
from W to IR m for any n,m and there are many other variations. Questions 
of level sets, multiple points, intersections with fixed sets, the images X{F) 
for various fractals F, etc. arise in all these situations. Analysis of such prob- 
lems often requires sophisticated probabilistic techniques alongside a variety of 
geometrical arguments. 

16.5 Notes and references 

The probabilistic basis of Brownian motion is described in the books by Karlin 
and Taylor (1975, 1981), Billingsley (1995) and Rogers and Williams (2000). 

Fractional Brownian motion was introduced by Mandelbrot and Van Ness 
(1968); see the papers in Mandelbrot (2002). The books by Adler (1981), Kahane 
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(1985), Samorodnitsky and Taqqu (1994) and Embrechts and Maejima (2002) 
give detailed treatments of the mathematics of fractional Brownian functions and 
surfaces, with the first two books including dimensional aspects and the latter 
two emphasizing self- similarity. Multifractional Brownian motion is presented in 
Ayache and Levy Vehel (1999) and Ayache, Cohen and Levy Vehel (2000). 

Stable processes were introduced by Levy (1948); see Samorodnitsky and 
Taqqu (1994), Rogers and Williams (2000) and Mandelbrot (2002). The surveys 
by Taylor (1973, 1986) mention many dimensional properties of these processes. 

Computational methods for generating Brownian paths and surfaces are dis- 
cussed, for example, in Voss (1985), Peitgen and Saupe (1988), Falconer and 
Levy Vehel (2000) and Embrechts and Maejima (2002). 



Exercises 

16.1 Use the statistical self-similarity of Brownian motion to show that, with probability 
1, a Brownian trail in K 3 has box dimension of at most 2. 

16.2 Let X : [0, oo) —* B& 3 be usual Brownian motion. Show that, with probability 1, 
the image X(F) of the middle third Cantor set F has Hausdorff dimension at most 
log 4/ log 3. (Harder: show that it is almost surely equal to log 4/ log 3.) What is 
the analogous result for index-a Brownian motion? 

16.3 Let X : [0, oo) — > K 3 be usual Brownian motion, and let F be a compact subset of 
K 3 . Use Theorem 8.2 to show that if dimuF > 1 then there is a positive probability 
of the Brownian trail X(t) hitting F . 

16.4 Let X(t) be Brownian motion, and c a constant. Show that the graph of the process 
X(t) + ct has dimension 1^ with probability 1. (This process is called Brownian 
motion with drift.) 

16.5 Show that, with probability 1, the Brownian sample function X : [0, oo) — »■ [R is 
not monotonic on any interval [t , u\. 

16.6 Let X(t) be Brownian motion. Show that with probability 1, X(t) = for some 
t > 0. Use self-affinity to show that with probability 1, for every a > 0, X(t) = 
for some t with < t < a, and thus X(t) = for infinitely many t with < t < a. 

16.7 Let X(t) be Brownian motion. Show that for q > 0, E(\X(t + h) - X(t)\i) = c\h\* 
where c depends only on q. 

16.8 Show that, if X > a, then with probability 1 for almost all t the Holder inequality 
(16.12) fails. 

16.9 Take j ^ ot\ ^ at < 1 and let X\(t) and X2U) be independent Brownian functions 
from [0, 1] to R. of indices a\ and a 2 respectively. Show that, with probability 1, the 
path in IR 2 given by {(Xx(t), X 2 (t)) : ^ t ^ 1} has Hausdorff and box dimensions 
of (1 + ci2 — ct\)/a,2- 

16.10 Verify that for index-a fractional Brownian motion E((X(t) - X(0))(X(t + h) - 
X(t)) = \[(t + h) 2a - t 2a - h 2 "]. Show that this is positive if ± < a < 1 and neg- 
ative if < a < 1/2. What does this tell us about the sample functions? 



Chapter 17 Multifractal measures 



A mass distribution /i may be spread over a region in such a way that the 
concentration of mass is highly irregular. Indeed, the set of points where the 
local mass concentration obeys an index a power law, say /j,(B(x, r)) ~ r a for 
small r, may determine a different fractal for different values of a. Thus, a 
whole range of fractals may arise from a single measure, and we may examine 
the structure of these fractals and their inter-relationship. A measure /x with such 
a rich structure is called a multifractal measure or just a multifractal. As with 
fractals, a precise definition of 'multifractal' tends to be avoided. 

Multifractal measures have been observed in many physical situations, for 
example in fluid turbulence, rainfall distribution, mass distribution across the 
universe, viscous fingering, neural networks, share prices, and in many other 
phenomena. 

An important class of multifractal occurs in dynamical systems, see Sec- 
tion 13.7. If / : D — > D is a mapping on a domain D, we can define a 'residence 
measure' by 

IX(A) = lim — #{k :l^k^m andf k (x) e A] 

for subsets A of D, where x e D is some initial point. Thus, /x(A) represents the 
proportion of time that the iterates of x spend in A. We have seen that the support 
of \x is often an attractor of / and may be a fractal. However, the non-uniformity 
of ijl across the attractor may highlight further dynamical structure which can be 
recorded and analysed using multifractal theory. 

Many ideas related to fractals have counterparts in multifractals, for example 
projection of multifractal measures onto a line or plane may be considered in 
an analogous manner to projection of fractals, though the calculations can be 
considerably more awkward. 

There are two basic approaches to multifractal analysis: fine theory, where we 
examine the structure and dimensions of the fractals that arise themselves, and 
coarse theory, where we consider the irregularities of distribution of the measure 
of balls of small but positive radius r and then take a limit as r — > 0. In many 
ways, fine multifractal analysis parallels finding the Hausdorff dimension of sets, 
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whilst coarse theory is related to box-counting dimension. The coarse approach is 
usually more practicable when it comes to analysing physical examples or com- 
puter experiments, but the fine theory may be more convenient for mathematical 
analysis. There are many parallels between the fine and the coarse approaches, 
and for many measures both approaches lead to the same 'multifractal spectra'. 

We outline coarse and fine multifractal theory, and then consider the special 
case of self-similar measures in more detail, both to illustrate the theory and to 
demonstrate central ideas such as Legendre transformation. 

Throughout our discussion, we refer to the following example of a self-similar 
multifractal measure on a Cantor set. Just as the middle third Cantor set illustrates 
many features of fractal mathematics, this self-similar measure supported by the 
Cantor set is typical of a large class of multifractal measures. 



Example 17.1 

Let p\, p2 > be given, with p x -\- p 2 — \. We construct a measure \i on the 
middle third Cantor set F — n^ () £^ by repeated subdivision. (Here E^ comprises 
the 2 k kth level intervals of length 3~ k in the usual Cantor set construction see 
figure 0.1.) Assign the left interval of E\ mass p\ and the right interval mass p 2 - 
Divide the mass on each interval of E\ between its two subintervals of E 2 in 
the ratio p\ : p 2 . Continue in this way, so that the mass on each interval of E^ 
is divided in ratio p\ : p 2 between its two subintervals of £jt+i. This defines a 
mass distribution on F, see Proposition 1.7. 

It is easily seen that if / is a kth level interval of E^ then fi(l) — p[p 2 where, 
in constructing /, a left-hand interval is taken r times and a right-hand interval 
k — r times. If p\ ■£ p 2 and k is large, the masses of the kth level intervals will 
vary considerably, see figure 17.1, and this is a manifestation of multifractality. 



17.1 Coarse multifractal analysis 

Coarse multifractal analysis of a measure /x on W with < /i(E.") < oo is akin 
to box-counting dimension, in that we count the number of mesh cubes C for 
which ijl{C) is roughly r a . (Recall that the r-mesh cubes in W are the cubes of the 
form \m\r, (mj + l)r] x ■ ■ ■ x [m n r, (m„ + l)r] where m\, . . . ,m n are integers.) 
For [i a finite measure on W and a ^ we write 

N r (a) = #{r-mesh cubes C with fx(C) ^ r a ], (17.1) 

and define the coarse multifractal spectrum or coarse singularity spectrum of 
[i as 

fc(a) = hm hm (17.2) 

e->0r->0 — log r 
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Figure 17.1 Construction of the self-similar measure described in Example 17.1. The 
mass on each interval of E^ in the construction of the Cantor set, indicated by the area 
of the rectangle, is divided in the ratio p\ : p^ (in this case | : |) between the two 
subintervals of E k+i . Continuing this process yields a self-similar measure on the Cantor 

set 



if the double limit exists. (We write log + x = max{0, log*}, which ensures /c(a) 
^ 0.) Definition (17.2) implies that if r\ > 0, and e > is small enough, then 



-fcM+n 



^ N r (a + s) - N r (a - e) ^ r 



-fcM-rj 



(17.3) 



for all sufficiently small r . Roughly speaking — /c(a) is the power law exponent 
for the number of r-mesh cubes C such that /x(C) — r a . Note that jcia) is not 
the box dimension of the set of x such that fi(C(x, r)) ~ r a as r —> where 
C(x,r) is the r-mesh cube containing x; the coarse spectrum provides a global 
overview of the fluctuations of /x at scale r, but gives no information about the 
limiting behaviour of /i at any point. 
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To allow for the possibility of the limit (17.2) not existing, we define the lower 
and the upper coarse multifractal spectra of /i by 

log + (JV r (o! + s) - NJa - e)) 

/ (a) = lim lim S - - (17.4) 

- c e-*.o r _> -logr 

and 

_ log+ (N r (or + s) - N r (a - e)) 

7c(a) = limlmwo (17.5) 

e->0 — logr 

for a ^ 0. 

It is usually awkward to calculate the coarse multifractal spectrum /c directly. 

Continuation of Example 17. 1. In this example, the set E k is made up of 2 k intervals 

of length 3 _t , and for each r, a number I I of these have mass p[p 2 ~ r , where 

k\ 

is the usual binomial coefficient. Thus, assuming without loss of generality 



that < p\ < 4, we have 



*3-(«) = E(j). 



where m is the largest integer such that 

pT^-^3-*-. i.e.m^' (1 ° g ^ + al0g3) . 

log p 2 - log pi 

It is now possible, but tedious, to estimate N3-4 (a) and thus examine its power 
law exponent. However, we shortly encounter a better approach. □ 

Next, we introduce moment sums: for q e U. and r > the qth power moment 
sum of fx is given by 

M,.( ? ) = ^/x(C)' ? , (17.6) 

M r 

where the sum is over the set M. r of r-mesh cubes C for which /x(C) > 0. 
(There is a problem of stability for negative q: if a cube C just clips the edge 
of spt/x, then fJi{C) q can be very large. There are ways around this difficulty, 
for example by restricting the sums to cubes with a central portion intersecting 
spt/M, but we do not pursue this here.) We identify the power law behaviour of 
M r (q) by defining 

log M r (q) 



/?(<?) = lim °, ' w/ , (17.7) 

r->0 —logr 



assuming this limit exists. 
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It is hardly surprising that these moment sums are related to the N r (a). Using 
(17.1), it follows that if q ^ and a ^ 



M r (q) = X>(0* > r« a N r (a), 



(17.8) 



Mr 



and if q < 



M r (q) = Y^ M(C) ? ^ r qa #{r-m&sh cubes with < /x(C) ^ r a ). (17.9) 



Mr 

These inequalities lead to a useful relationship between / c (a) and P(q) in 

terms of Legendre transformation. The Legendre transform f L of f> is defined 
for a ^ by 

f L (a)= inf {/%) + a ? }, (17.10) 

— oo<g<oo 

provided this is finite. There is a useful geometrical interpretation of the Legendre 
transform. For those a for which the graph of ft has under it a line of support L a 
of slope —a, the Legendre transform of /? is given by the value of the intercept 
of L a with the vertical axis, see figure 17.2. 

It is easy to see that the coarse spectrum is bounded above by the Legendre 
transform of p. 

Proposition 17.2 

Let \i be a finite measure on W 1 : , and assume that the limit in (17.7) exists. Then 
for all a ^ 0, 

/ C (a)^/ C (a)^/ L (a) (17.11) 

where fi is the Legendre transform of ji. 



fi{q) curve 




Figure 17.2 The Legendre transform of fi(q) is /(a), the intersection of the tangent L a 
of slope —a with the /J-axis 
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Proof. First take q ^ 0. Then given e > 0, (17.8) and (17.5) imply that 

M r (q) > r q{a+E) N r (a + s) > r q{a+£) r- lc( - a)+e (17.12) 

for arbitrarily small r. It follows from (17.7) that 

-fi(q) sC q{a + e) - / c (a) + e, 

so fc(oe) < /6(g) + ag by taking e arbitrarily small. This inequality also holds 
when q < by a parallel argument, using (17.9) with a replaced by a — s. □ 

The Legendre transform fi of /3 given by (17.10) is termed the Legendre 
spectrum of /x. There are many measures for which the Legendre spectrum 
equals the coarse multifractal spectrum, i.e. for which equality occurs in (17.11). 

Continuation of Example 17. 1. Recalling that in this example I J of the £fh level 
intervals of lengths 3~ k have mass p[p 2 ~ r , we get 

MsMq) = f:( k r )prpf- r) = (Pi+P q 2) k - 

Hence 

logM r (g) log Af 3 - t (g) log(gf + p\) 

P(q) = hm = hm — — — = — . (17.13) 

r^o — logr fc-»oo —log 3 * log 3 

(It is easy to see that it is enough to let r tend to through the values 3~ k , 
compare (3.14).) 

The Legendre spectrum of /i comes from taking the Legendre transform of /3, 
that is by minimizing /3(q) + aq over q e IR, Elementary calculus gives that the 
minimum occurs when q satisfies 

= pU°SPi + PJ l °SP2 
(pf+p q 2 )\og3 

The Legendre spectrum is the minimum value, which is given in terms of the 
parameter q by 

, , , Iog(/>i + P\) g(p[logPi + p|logP2) ,..,, 

h(a)= : —q- q— (17.14) 

log 3 (pI+pD log 3 

The graph of /l(«) when p\ — i, pi — | is displayed in figure 17.3. As we 
shall see in Section 17.3, /l(«) = fc(&), i-£- the Legendre spectrum coincides 
with the coarse multifractal spectrum. □ 
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q = -4 



Figure 17.3 The multifractal spectrum of Example 17.1 with p\ = i p 2 = I 

In practical situations, multifractal spectra are often awkward to estimate and 
work with. One might hope to compute the coarse spectrum fc by 'box-counting'. 
For instance, if /x is a residence measure on the attractor of a dynamical system in 
the plane, a count of the proportion of the iterates of an initial point that lie in each 
r-mesh square C might be used to estimate the number of squares for which a, SC 
log/x(C)/logr < dfj+i, where ^ a\ < ■ ■ ■ < a^. Examining this 'histogram' 
for various r enables the power law behaviour of N r (a + e) — TV,. (a + e) to be 
studied and f(a) to be estimated. However, this histogram method tends to be 
computationally slow and awkward. 

In general, it is more satisfactory to use the method of moments for experimen- 
tal determination of a multifractal spectrum. Thus, for a range of q, the moment 
sums (17.6) are estimated for various small r and the power law behaviour 
examined to estimate P(q) using (17.7). Legendre transformation of /3 gives a 
Legendre spectrum of /x, and this is often taken to be the coarse spectrum. The 
method of moments is usually more numerically manageable than the histogram 
method. 



17.2 Fine multifractal analysis 

Fine multifractal analysis looks directly at the fractals determined by the local 
intensity of a measure. Let /x be a measure on E." with < /x([R") < oo. We 
define the local dimension or Holder exponent of \i at x by 



dim loc /x(x) 



lim 



log fi(B(x,r)) 
logr 



(17.15) 
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provided this limit exists. (More generally, one can work with upper and lower 
local limits leading to upper and lower local dimensions, but we do not pursue 
this here.) We study the sets of points x e W where the local dimension takes 
particular values. For a ^ we define 

F a = [x e E." : dim] oc /x(jc) = a) 

\ log a(B(x,r)) ) 

= \xe IR" :lim s ^ v v " =a . (17.16) 

1 r^o logr I 

Thus, F a comprises those points where the local dimension exists and equals a. 
In fine multifractal analysis we aim to find the dimensions of F a for a range 
of a. In most examples of interest F a is dense in the support of /x so dim B F„ — 
dim B F a — dim B (spt jjl) (and similarly for upper box dimensions) by Proposition 
3.4, so box-counting dimensions are of little use in discriminating between the F a . 
Thus, we concentrate on the. fine (Hausdorff) multifractal spectrum or singularity 
spectrum of /x defined by 

Ma) = dim H /v (17.17) 

(We do not consider here the fine packing spectrum which has a similar definition 
using packing dimension.) 

Clearly, ^ /h(o0 ^ dim H (spt/x) for all a ^ 0, and it follows from Proposi- 
tion 4.9(b) that 

s: Ma) s: a. (17.18) 

Continuation of Example 17. 1. We may code the points x of the Cantor set F as 
(;'i , ii, . . .) in the usual way, with iy. — 1 if x is in the left sub-interval at the kth 
stage of the Cantor set construction, and i^ — 2 if x is in the right sub-interval. 
Then, writing nj(x\k), j — 1, 2, for the number of occurrences of j in the first k 
terms of the sequence coding x, we have fi(B(x, 3~ k )) — p" p^ , so that 

n (B(x, 3~ k )) _ -1 fni(x\ k ), , n 2 (x\ k ) 

log 3~ k log 3 



lo § — 1—L* = T7— 7 ( Z — lo § Pi + Z — lo § Pz 



Thus, if rij(x\k)/k —> clj, j — 1, 2, then x e F a where a — —(a\ \ogpi + 
a2log/72)/log3. In this instance, we could now calculate fria) — dimn^a as 
in Proposition 10.1, and this gives a parametric expression equivalent to (17.14). 
We defer a formal calculation until the next section, where it will be done in a 
more general context. □ 

Just as the Hausdorff dimension of a set is never more than its box dimension, 
there is a basic inequality between the fine and coarse spectra. 
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Lemma 17.3 

Let /x be a finite measure on W, Then 

Ma) s: /^(a) ^ 7 c (a) (17.19) 

for all a Jj 0. 

Proof. The right-hand inequality of (17.19) is obvious. We prove the left-hand 
inequality in the case where /x is a measure on R; the proof is similar for n ^ 2 
except that measures of balls and cubes have to be compared instead of measures 
of intervals. 

For fixed a > write for brevity / = /h(«) = dimH^ a ; we may assume 
/ > 0. Given < £ < / then n f ~ e (F a ) = oo. By (17.16) there is a set 
F® C F a with Hf~ s (F®) > 1 and a number c'o > such that 

3r a+E ^ ia(B{x, r)) < 2 E - a r a - £ (17.20) 

for all x e F® and all < r $C cq, using Egoroffs theorem. We may choose S 
with < S sC ±c such that h{~ e (F°) ^ 1. 

For each r ^ S we consider r-mesh intervals (of the form [mr, (m + l)r] with 
m e Z) that intersect F®. Such an interval / contains a point x of F®, with 

5(x, r) C / U I L U I R C B(x, 2r), 

where I L and Tr are the r-mesh intervals immediately to the left and right of /. 
By (17.20) 

3r a+£ sC n(B(x, r)) sC /x(/ U / L U /«) sC /i(B(x, 2r)) < r a ~ E , 

so that 

r a+£ ^ m(4) < >^ e , (17.21) 

where /o is one of /, I L and / s . By definition of H s there are at least 
r e ~f?i s S (F®) ^ r 8 ~f distinct r-mesh intervals that intersect F®, so there are at 
least |r £ ~^ distinct r-mesh intervals Iq that satisfy (17.21) (note that two inter- 
vals / separated by 2r or more give rise to different intervals Iq). We conclude 
that for r ^ S 

N,.(a + e) - N r (a - e) ^ \r E ~ f , 

so from (17.4) /_(«) > / — e; as this is true for all positive e we conclude 

fjrXot) > f = Ma). a 
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Just as many fractals we have encountered have equal box and Hausdorff 
dimensions, many common multifractal measures have equal coarse and fine 
spectra, that is, equality in (17.19). In the next section, we shall show that this 
is the case for self-similar measures. 

By setting up Hausdorff-type measures tailored for multifractal purposes, we 
can define a quantity /3(q) playing a similar role in the fine theory to that of (17.7) 
in the coarse theory. The Legendre transform of this /3(q) gives an upper bound 
for the fine multifractal spectrum, and again in many cases gives the actual value. 

17.3 Self-similar multifractals 

In this section we calculate the fine multifractal spectrum of a self-similar measure 
on IR, of which Example 17.1 is a specific case. Not only are self-similar measures 
important in their own right, but also the methods to be described adapt to many 
other classes of measures. 

We consider a self-similar measure /i supported on a self-similar subset F 
of IR. Let Si, . . . , S m : IR — > IR be contracting similarities with ratios ci, . . . ,c m . 
As in Chapter 9, the iterated function system {Si, . . . , S m ] has a (unique non- 
empty compact) attractor F C IR. We assume that a strong separation condition 
holds, that is there is a closed interval / such that 5,(7) C I for all i, and 
S t (I) n Sj(l) = whenever i ^ j. 

As in Chapter 9, we index the intervals in the natural construction of F by 
the finite sequences Xj = {(ii, . . . , {%) : 1 ^ i $C m), writing i = (ii, . . . , ;'t) for a 
typical sequence of Xj. Thus 

Ii = I h ,..., h = S h o...oS h (I). (17.22) 

For convenience, we assume that |/| = 1, so for i = (;'i, 12, ■ ■ ■ ,ik) S Ik we have 

|7,| =d = c h c h ■■■c h . (17.23) 

We define a self-similar measure /x with support F by repeated subdivision. 
Let pi, . . . , p m be 'probabilities' or 'mass ratios', that is pi > for all i and 
Y^iL\ Pi = 1- Repeatedly dividing the mass on intervals /,, , t between subin- 
tervals /,,,...,,,., 1, . . . , Ii u ...j k . m m the ratios p\ : ■ ■ ■ : p m defines a measure \± on 
F, see Proposition 1.7. Thus 

n{IhM,...,ik) = Pi- PhPh ■ ■ ■ Pit- (17.24) 

It is easy to see that /x is a self-similar measure, in the sense that 

n{A) = Y J Pi^(S;\A)) (17.25) 

i=\ 

for all sets A. 
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Next, given a real number q , we define ft — ft (q ) as the positive number satisfying 

m 

J2 Pi ? c f(«) = l; (17.26) 



since < c,, /?, < 1 such a number fi(q) exists and is unique. Moreover 
/J : R —> R is a decreasing function with 

lim fl{q) — oo and lim P(q) — — oo. (17.27) 

Differentiating (17.26) implicitly with respect to g gives 

= E Pi c f (?) ( lo § A- + ^ log cA . (17-28) 

and differentiating again gives 

0=2^ Pi c i I "TT lo § c * + ( lo § Pi + T~ lo § c ' 

Thus, d 2 /0/df7 2 ^ 0, so ft is convex in g. Provided log pj log c,- is not the same 
for all i = 1, . . . , m, then d 2 /3/dg 2 > and /3 is strictly convex; we assume 
henceforth that this holds to avoid degenerate cases. 

We will show that the fine (Hausdorff) multifractal spectrum, ///(a) = dim F a , 
is the Legendre transform of ft, where, as before, the Legendre transform f of /3 
is defined by 

/(<*)= inf {p(q)+aq}, (17.29) 



provided this is finite. Since /3 : R —> R is a convex function, there is a range 
of a, say a e [a m i n , a ma x] f° r which the graph of ft has a line of support Lq, of 
slope —a, and for such a this support line is unique (thus — a m \ n and — a max are 
the slopes of the asymptotes of the graph of /3). Then the Legendre transform 
of ft is / : [a m in, a max ] — >■ R given by the value of the intercept of L a with the 
vertical axis, again see figure 17.2. It is clear that / is continuous in a. 

Since ft is strictly convex, for a given a the infimum in (17.29) is attained at 
a unique q — q(a). Equating the derivative of P(q) + aq to 0, this occurs when 

dB 

a = — - (17.30) 

Aq 

so that 

f(a) = aq + B(q) = -q^-+f3(q). (17.31) 

aq 
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Note that if any one of q e R, f> e E. and a e (a m ; n , a max ) is given, the other two 
are determined by (17.26), (17.30) and (17.31). Using (17.30) and rearranging 
(17.28) gives 

Em q 3 ! 

, = 1 /7,C. log Pi 

01 = ™ — iTi ■ (17 - 32) 

On inspecting this expression, we see that 

a m in = min log p,/ log c,- and a max = max log />,•/ log c* (17.33) 

which correspond to g approaching oo and — oo, respectively. 

Provided that the numbers {log P;/logc,-}^_j are all different, /(a m i n ) — 
/(ttmax) = 0, see Exercise 17.8. 

Differentiating (17.31) with respect to a and using (17.30), 

df do dfldo 

-L = a^- + q+-f-^- = q. (17.34) 

da da dg da 

Since q decreases as a increases, it follows that / is a concave function of a. 

Some values of q are of special interest, see figure 17.4. If q — then /3(q) — 
dim H F = dim H (spt/M), comparing (17.26) and the dimension formula for self- 
similar sets (9.13). Moreover, by (17.34), q — corresponds to the maximum of 
/(a); hence, dime-F = diiriH(spt/M) = max a /(a). 

When q — 1 (17.26) implies P(q) — so /(a) = a by (17.31). Moreover, 
d/(a)/da = q — 1, so that the /(a) curve lies under the line f — a and touches 
it just at the point corresponding to q = 1, It will follow later that a(l) = 
/(a(l)) = dim H /x, where 

dime/x = inf{diiriH.E : E is a Borel set with //(£) > 0} (17.35) 

is the Hausdorff dimension of the measure fi. Thus the dimension of /i is the 
dimension of a set on which a significant part of /x is concentrated. 

We can now state the main result on the fine multifractal spectrum of a 
self-similar measure, that the Hausdorff multifractal spectrum is given by the 
Legendre transform (17.29) of /3(q) given by (17.26). 

Theorem 17.4 

Let ijl be a self-similar measure as above and let 

F a — {x e W : dimi oc /x(x) = a}. 
If a $ [a min , a max ] then F a = 0, and if a e [a min , a max ] then 

/ H (a) = dim H F a = /(a), (17.36) 

where f is the Legendre transform of P(q). 
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slope-^maxV 




slope-a n 



(a) 




Figure 17.4 Form of the multifractal functions for a typical self-similar measure, (a) The 
P(q) curve, (b) the 'multifractal spectrum' f(a) = dim H F a , which is the Legendre trans- 
form of P(q) 



We first give a simple partial proof, and then a slightly more involved full 
proof of this theorem. 

As is usual in this sort of work, it is convenient to redefine F a in terms of the 
component intervals I[ rather than the balls B(x,r). For x e spt/i we write h(x) 
for the feth level set /, 1 .... i , t that contains x. We shall go back and forth between 
the set h{x) and the ball B(x, r) where |4(x)| is comparable with r. 
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Lemma 17.5 

For all x e F 

,. , , ,. log/x(5(x,r)) log/x(/j:(x)) ,n,^ 

dimi oc /x(jc) = hm = lim , (17.37) 

r ^o logr fc^oo log|4(x)| 

w/f/i either both or neither of these limits existing. In particular 

F a = \x€F: lim l °^^ x) J = a \ , (17.38) 

I i^oo log 1 4 (X) | J 

Proof. Let b — miiii^,^ m |/, | and let d be the minimum gap between the intervals 
I\ , . . . , I m . Let x e F and let < r < 1 , and let k be the integer such that 

\h(x)\ ^r < |/ M (i)| _1 |/*(*)|. 

Then /^(jc) C B(x, r) and B(x, dr) D E C /*(jc) since the gap between h(x) and 
any other £th level interval is at least d\Ik~\(x)\ > dr. Then 

^(B(x, dr)) sC n(I k (x)) sC /x(5(x, r)). 

Hence, 

log/x(£(x, Jr)) _ log/x(^(x,<ir)) log/x(4(x)) log/z(£(x,r)) 
log(iir) — logd logr log|4(x)| logfor 

^ logMflQ^r)) 
log £> + log r 

Letting r — > 0, that is A: — ^ oo, gives the result. D 

We now give a direct proof of the upper bound dimuF a ^ /(a); this is included 
as it uses the idea of summing over a restricted class on intervals, a technique 
which is used frequently in multifractal theory. 

Partial proof of Theorem 17.4. We shall show that f(a) ^ B(q) + qa for q > 0. 
The case of q < is similar, but with sign changes, see Exercise 17.13, and 
taking all values of q together gives dim H /v $C f(ot). 

Take e > 0. Let Qj denote those &fh level sequences i = (ij, . . . , 4) S 2* 
such that 

IAr +£ ^ M(/i)- (17.39) 
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Then: 



= Yl foi C; 2 • • • C hf (Ph Ph ■ ■ ■ Ph) q 



k 



T.Pi c i) =!■ ( 17 - 4 °) 



w = l 



using a multinomial expansion and (17.26). 
For each integer K, write 

F K = {x e F : /x(40)) > \h(x)\ a+s for all fc ^ X"}. 

Then, for all k > K, the set F K C Ui eQi 7 i b Y (17.39), so by (17.40), 
TL t (F K ) SC 1, since for a feth level interval, |/,| $C c k where c — 

max KKm c ; . Letting k -> oo gives U^ +q(a+e) {F K ) sC 1, so that dimeC^) sC 
+ q(a + e). But F a C \Jk=i pK b Y (17.38), since if log|*(/*(*))/log|/*(*)| 
— > a then ii(Ik{x)) ^ |/t(x)| ff+£ for all fc sufficiently large. Thus dim\i(F a ) ^ 
[J + q(a + s) for all s > 0, giving dim^/v) ^ /? + ^a. □ 

[*The rest of this section may be omitted.] 

We now embark on a full proof of Theorem 17.4. Writing 



*fo,/8) = X>? c f ( 17 - 41 ) 

i=i 

for gjGl, /Kg) is defined by <£>(q, P(q)) — 1, see (17.26). We require the 
following estimate of <t> near (q, /3(q)). 



Lemma 17.6 

For all s > 0, 

and 



<S>(q + 8,f3(q) + (-a + s)8)<l (17.42) 



<t>(q - 8, p(q) + (a + e)8) < 1 (17.43) 

for a// sufficiently small 8 > 0. 
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Proof. Recalling that d/3/dq — —a, expansion about q gives 

P(q +S) = B(q) -aS+ 0(8 2 ) < R(q) + (-a + e)8 

if S is small enough. Since <$>(q + 8, ft(q + 8)) — 1 and <t> is decreasing in 
its second argument, (17.42) follows. Inequality (17.43) is derived in a similar 

way. □ 

To prove Theorem 17.4, for each a we concentrate a measure v on F a and 
examine the power law behaviour of v(B(x, r)) as r — > 0, so that we can use 
Proposition 4.9 to find the dimension of F a . Given q e R and B — B(q), (17.26) 
enables us to define a probability measure v on spt/i by repeated subdivision 
such that 

v(/«„...,,J = (p h p h • ■ • P k ) q (c h c h ■ ■ ■ c ik f, (17 '.44) 

for each {i\, . . . , i k ), and extending this to a measure on F in the usual way. 
Together with (17.23) and (17.24) this gives three ways of quantifying the 7f: 

\h\=ci, H(li) = pu v(7,) = pfcf (17.45) 

for all i = (i'i, . . . , ik). 

The following proposition contains the crux of the multifractal spectrum cal- 
culation. 

Proposition 17.7 

With q, ft, a and fas above, and with v determined by (17.44), 

(a) v{F a ) = 1, 

(b) for all x e F a we have log v{B(x, r))/\ogr — > f(a) as r — >■ 0. 

Proof. Let s > be given. Then for all 8 > 

v{x : n(I k (x)) > [7 t (je)|"- e } = v{x : 1 ^ j»(/*(*))'|J*(*)l (8 ~ a) '} 

^ j lx(I k (x)) s \I k (x)\ (e - a)S dv(x) 



leXk 

P+(£-a)& 
Pi C 



a q 



= 2- # c - 



0+(.s-a)S 



= [®(q+8,B + (e-a)8)] k , 



Self-similar multifractals 293 

where <S> is given by (17.41), using (17.45) and a multinomial expansion. Choos- 
ing 5 small enough and using (17.42) gives that 

v{x : fi(I k (x)) > \I k (x)\ a - e } ^ y k (17.46) 

where y < 1 is independent of k. Thus 

00 

v{x : Kh(x)) > \h(x)\ a - s for some k > K] ^ J^ / < Y K /Q ~ ¥)■ 

k=K 

It follows that for v-almost all x we have 

log/x(4(x)) 

lim ^ a — s. 

k^oo log 14001 

This is true for all £ > 0, so we get the left-hand inequality of 

log|t(4(x)) — - log /*(/*(*)) 
a ^ lim < lim ^ a. 



k^rDO 



l0g|/*(*)| *^oo log|/*(*)| 



The right-hand inequality follows in the same way, using (17.43) in estimating 
v{x : /x(4(/)) < \I k (x)\ a+e }. From (17.37) we conclude that for v-almost all x 

,. t -, r log /*(/*(*)) 
dimi oc /x(x) = lim = a 

i^oo log | I k (X) | 

and it follows that v(F a ) — v{F) — 1. 
For (£>) note that from (17.45) 

\ogv{I k {x)) log ix(I k (x)) log|4(x)| 

= a hp , (17.47) 

log|4(x)| log|/ t (x)| %g|J*(*)|' 

so for all x e F a 

log v(4(x)) 
S -» qa + p = f (17.48) 

log|4W| 

as fc — »■ oo, using (17.31). Part (b) follows, applying (17.37) with v replacing 
IX. D 

Our main Theorem 17.4 on the multifractal spectrum of self-similar measures 
now follows easily. 

Proof of Theorem 17.4. From (17.45) 

log |t W) E5=i 1o 8#j 



log|/,| £* =1 logc; 
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where i= (i u ...,i k ), so by (17.33) logM(/i)/log|A| e [a m i n ,a m ax] for all i. 
Thus the only possible limit points of log/x(/i)/log |/,|, and so by (17.37) of 
log B(x, r)l log r, are in [a min , a m . dx ]. In particular, F a = if a £ [a min , a max ]. 

If a e (ttmin, «max) then by Proposition 17.7 there exists a measure v concen- 
trated on F a with lim r ^olog v(B(x, r))/logr = f(a) for all x e F„, so (17.36) 
follows from Proposition 4.9. For the cases a — a m \ n and a = a max , see Exercise 
17.14. □ 

Thus, for a self-similar measure, the dimension of F a may be calculated by 
taking the Legendre transform of /3(q) given by (17.26). 

The dimensions of spt/i and of the measure /i, see (17.35), may easily be 
found from the multifractal spectrum. 

Proposition 17.8 

Let \i be a self-similar measure as above. Regarding a — u(q) as a function of q, 

(a) f(a) takes its maximum when a — a(0), with f(a(0)) — diiriHspt/M. 

(b) /(«(1)) = a(l) = dime/x. 

Proof. Part (a) and that /(a(l)) = a(l) were noted as a consequence of (17.34). 
For the dimension of the measure fi, if q = 1 then ft — from (17.26), so by 
(17.44) the measure v is identical to \±. By Proposition 17.7 \i(F a (\)) — 1 and 
dimi ocy u,(jc) = /(a(l)) for all x e F a (\), so by Proposition 4.9, dime£ = f(a{\)) 
for all E with Ijl{E) > 0. Thus (b) follows from the definition of the dimension 
of a measure (17.35). □ 

Next we show that the coarse spectrum of a self-similar measure is also equal 
to f(a), the Legendre transform of /3. 

Proposition 17.9 

Let ijl be a self-similar measure on E. as above. Then 

f^itt) = 7 c (a) = f H (a) = /(a) (17.49) 

for all a — a(q) for which q Jj 0. 

Proof. We first note that by Theorem 17.4 and Lemma 17.3 we have f(a) — 
fw{a) ^ /p(of) ^ / c (a), where the coarse spectra are given by (17.4) and (17.5). 
To prove the opposite inequality, let d be the minimum separation of /, and Ij 
for i ^ j, and write a — 2/d. Given r < a" 1 , let Q be the set of all sequences 
i = (j'i, . . . , i k ) e T k such that \Ii u ...j t \ < ar but \Ii u ...j t _ i \ > ar. Then 

abr < |/j| = C\ ^ ar (17.50) 
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if i £ Q, where b — mini^,<c,„ c,. We note that each point of F lies in exactly 
one set I- y with ie Q, and also that for distinct i,j e Q the sets I[ and 7j have 
separation at least dar — 2r. 

Suppose q > and let /3, a and / be the corresponding values given by 
(17.26), (17.30) and (17.31). Then 

#{i e Q : n(h) > a- a \h\ a } =#{ieQ:U (a°|7i|- At(/i))«} 

= #{ieQ:l < a aq p\c7 aq } 

^a a «J^p«c7 aq 
^a aq {abry ? -°" i y\pl4 



ieQ 



^a aq {ab)- } r 



f r -f 



using (17.50), (17.31) and that ^ ieQ p\ c \ — 1> an identity that follows by repea- 
ted substitution of Y17=\ Pii c u — p\ c \ m itself- Every r-mesh interval intersects 
at most one of the sets l\ for ie Q. With N r (a) as in (17.1) 

N r (a) — #{r-mesh intervals C : /x(C) ^ r a } 

a{ieQ:M(/i)>^"|/in 
«C a aq (aby f{a) r- f{a \ 

using (17.50). It follows that there is a number c such that for sufficiently small 
e and r 

N r (a + e)- N,.(a - e) sC N r (a + s) ^ cr _/(a+e) , 

so, by (17.5), f c (a) ^ f(<X + e) for all e > 0; since / is continuous f c (oc) ^ 
f(a). a ' 

The coarse spectrum, as we have defined it, is not well-enough behaved to 
give equality in (17.49) for a corresponding to q < 0. 

Continuation of Example 17.1. Proposition 17.9 allows us to complete our 
analysis of this example. With fi(q) — log(p* + p%)/log3, the coarse and fine 
multifractal spectra of p. are equal to the Legendre transform f(a) of /3(g), that 
is /c(a) = / H (a) = /(«) for a e [a min ,a max ], where a min = min, = 12 log/>,/ 
— log 3 and a max = max^i^logp,-/ — log 3. As before, this leads to the 
parametric expression for / given in (17.14). 

These methods generalize to many further multifractal measures. In particular, 
the derivation of the multifractal spectrum for self-similar measures extends to 
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self-similar measures on W without difficulty, and the separation condition on 
the intervals 5,(7) can be weakened to the open set condition. 



17.4 Notes and references 

A great deal has been written on multifractals, and multifractal spectra have been 
calculated for many specific measures. We mention a small selection of references 
where further details may be found. 

The idea of studying measures from a fractal viewpoint is implicit in Mandel- 
brot's essay (1982); see also Mandelbrot (1974). Legendre transformation was 
introduced into multifractal analysis in Frisch and Parisi (1985) and Halsey et al. 
(1986). Various approaches at a fairly basic level are given by Evertsz and Man- 
delbrot (1992), Falconer (1997), Feder (1988), McCauley (1993) and Tel (1988). 
Mandelbrot's 'Selecta' (1999) includes reprints of many papers, as well as a wide- 
ranging survey and a comprehensive bibliography on multifractals. The book by 
Harte (2001) gives a substantial treatment of the theory with many references, 
and in particular addresses statistical estimation of spectra. 

Measure theoretic approaches to multifractal theory, including use of measures 
of Hausdorff type and the relationships between different types of spectra, are 
given by Brown, Michon and Peyriere (1992), Olsen (1995) and Pesin (1997). 
A careful treatment of the coarse theory is given by Riedi (1995). 

Self-similar measures are analysed by Cawley and Mauldin (1992), and Edgar 
and Mauldin (1992) and Olsen (1994) extend this to graph-directed construc- 
tions of measures. For self-similar multifractals with infinitely many similarities, 
see Mandelbrot and Riedi (1995) and Mauldin and Urbariski (1996). Extensions 
to random self-similar measures are considered by Mandelbrot (1974), Kahane 
and Peyriere (1976) and Olsen (1994), to self-conformal measures and weaker 
separation conditions by Patzschke (1997), and to vector-valued multifractals 
by Falconer and O'Neil (1996). McCauley (1993), Simpelaere (1994) and Pesin 
(1997) consider multifractal measures that occur in dynamical systems. Jaffard 
(1997) discusses the local form of functions from a multifractal viewpoint. 

There are many other interesting aspects of multifractal behaviour, such as the 
interpretation of negative dimensions, see Mandelbrot (1991), and the geometrical 
properties of multifractals, such as their behaviour under projections, intersections 
or products, see Olsen (2000). 



Exercises 

17.1 Find the Legendre transform of fi(q) = eT q . 

17.2 Let [iy , ji2 be finite measures on W with disjoint supports, and define v = fi\ + \ii- 
Show that fu(a) = m-dx{f^(a), f^(a)} where /^, f^ and /^ are the Hausdorff 
spectra of v, /ii and (i2- Deduce that fu(a) need not be concave over the range of 
a for which it does not vanish. 
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17.3 Let |ibea finite measure on R" and let g : R" -*■ R" be a bi-Lipschitz function. 
Define the image measure v on R" by v(A) = ji(g^ 1 (A)). Show that the local 
dimensions dimi oc /x(x) and dim\ oc v(g(x)) are equal when either exists. Hence show 
that fn(a) given by (17.17) is the same for both v and ji. 

17.4 Let (i be a self-similar measure supported by the 'middle half Cantor set, so C\ = 
c 2 = t, with the mass repeatedly divided between the left and right parts in the ratio 
P\ '■ Pi, where p\ + p2 = 1 and p\ < p 2 . Show that fi(q) = log(pj' + p 2 )/log4, 
and hence find a and / in terms of the parameter q . 

17.5 Let fi be the measure on the middle half Cantor set of Exercise 17.4. For q > 
estimate the moment sums (17.6). [Hint: consider the case where r = 4~ k first.] 
Hence show that fi(q) given by (17.7) coincides with that defined by that stated in 
Exercise 17.4. 

17.6 Let (i be a self-similar measure constructed by repeated subdivision of the support 
in the ratios C\ = j and c 2 = j and of the measure in ratios p\ and p 2 . Obtain an 
explicit formula for fi(q). 

17.7 Let /x be the measure of Example 17.1, with fi given by (17.13). Show that fS(q) — 
P(—q) = q log(pi/J2)/ log 3. Hence show the multifractal spectrum /(a) given by 
the Legendre transform (17.10) is symmetrical about a = — log(p 1 p 2 )/21og3, that 
is f(a) = /(-log(p,p 2 )/log3 -a). 

17.8 Let fi be the measure of Example 17.1, with < p\ < p 2 < 1 and fi given by 
(17.13). Show that for large q, fi(q) = q\ogp 2 /log3 + o(l) and obtain a similar 
expression for q large negative. (Here o(l) means a function of q that tends to as 
q -*■ oo.) Deduce that the asymptotes of the graph of P(q) pass through the origin, 
that a min = -logp 2 /log3, a max = -log/?,/ log 3 and that /(a min ) = /(a ma x) = 
0. 

17.9 For the self-similar measures discussed in Section 17.3, show that d 2 //da 2 < 
for a m j n < a < a max ; thus the /(a) curve is strictly concave. 

17.10 With p defined by (17.26), show that R(q) ^ 1 - q if < <? ^ 1 and 
B(q) ^ 1 — q if q > 1. (Hint: recall Holder's inequality: Yl?=i a i^i ^ 
GXi af) 1/p ( £7=i bf) W , where 1 < p < oo and \/p + \/p' = 1.) 

17.11 Let ji be a finite measure on R 2 and let proj denote projection onto a given 
line L. Define the projection of \i onto L by (proj/x)(A) = fi[x e R 2 : projx e 
A] for A <Z L. For x £ 0& 2 , show that lim r ^ l°g((proj )J.)(B L (proj x, /")))/ log r < 
lim,-^olog/x(B(;t, r))/logr, where S/.(y,r) is the interval within L of centre y 
and length 2r. 

17.12 Adapt the 'Partial proof of Theorem 17.4' to show that dim H F„ ^ /3(q) + qa for 
the case of a corresponding to q < 0. (Hint: consider £ |/|/ ? +'/( Q! - £ ) f or intervals 
/ with n(I) sC |/r _e .) 

17.13 Extend the 'Partial proof of Theorem 17.4' to show that, for a corresponding to 
q > 0, 

dim H {x e F : lim \og[i{I k (x))/log\h{x)\ ^ a) ^ f(a). 



17.14 Prove (17.36) when a = a m j n . (Hint: take a close to a m j n , and note that Propo- 
sition 17.7 remains true with (b) replaced by 'for all x such that log fi(B(x, r))/ 
logr < a we have lim r ^ok>g v(B(x, r))/\ogr ^ /(a)'.) 



Chapter 18 Physical applications 



Cloud boundaries, mountain skylines, coastlines, forked lightning, . . .; these, and 
many other natural objects have a form much better described in fractal terms than 
by the straight lines and smooth curves of classical geometry. Fractal mathematics 
ought, therefore, to be well suited to modelling and making predictions about 
such phenomena. 

There are, however, considerable difficulties in applying the mathematics of 
fractal geometry to real-life examples. We might estimate the box dimension of, 
say, the coastline of Britain by counting the number Ng of mesh squares of side 8 
intersected by the coastline. For a range of S between 20 m and 200 km the graph 
of log Ng against — log S is closely matched by a straight line of slope about 1 .2. 
Thus the power law N$ — constant x <5~ L2 is valid for such S and it makes sense 
to say that the coastline has dimension 1.2 over this range of scales. However, 
as S gets smaller, this power law first becomes inaccurate and then meaningless. 
Similarly, with other physical examples, estimates of dimension using boxes of 
side S inevitably break down well before a molecular scale is reached. 

The theory of fractals studied in Part I of this book depends on taking lim- 
its as S — > 0, which cannot be achieved in reality. There are no true fractals 
in nature — for that matter, there are no inextensible strings or frictionless pul- 
leys either! 

Nevertheless, it should be possible to apply the mathematical theory of 'exact' 
fractals to the 'approximate' fractals of nature, and this has been achieved con- 
vincingly in many situations. This is analogous to the well established use of 
classical geometry in science — for example, regarding the earth as spherical 
provides a good enough approximation for many calculations involving its grav- 
itational field. 

Perhaps the most convincing example of a physical phenomenon with a fractal 
model is that of Brownian motion; see Chapter 16. The underlying physical 
assumption, that a particle subject to random molecular bombardment moves with 
increments distributed according to a normal distribution, leads to the conclusion 
that the particle trail has dimension 2. This can be checked experimentally using 
box-counting methods. The motion can also be simulated on a computer, by 
plotting a trail formed by a large number of small random increments. The 
dimension of such computer realizations can also be estimated by box counting. 
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Brownian motion, which may be observed in reality or on a computer, has a 
fractal form predicted by a theoretical model. (It should, perhaps, be pointed 
out that even Brownian paths fail to be described by fractals on a very small 
scale, since infinite energy would be required for a particle to follow a nowhere - 
differentiable path of dimension 2.) Linking up experiment, simulation and theory 
must surely be the aim with other physical manifestations of fractals. 

The study of fractals in nature thus proceeds on these three fronts: experiment, 
simulation and theory. Physical objects are observed and measured, dimensions 
and, perhaps, multifractal spectra are estimated over an appropriate range of 
scales, and their dependence on various parameters noted. Theoretical techniques, 
such as assuming the Projection theorem 6.1 to estimate the dimension of an 
object from a photograph, are sometimes used here. Of course, for a dimension 
to have any significance, repeating an experiment must lead to the same value. 

Whilst a dimension may have some interest purely as a physical constant, it 
is much more satisfying if fractal properties can be explained in physical terms. 
Therefore, the next stage is to devise some sort of mechanism to explain the 
natural phenomena. Computational simulation then permits evaluation of various 
models by comparing characteristics, such as dimension, of the simulation and in 
reality. Computational methods are always approximate; this can actually be an 
advantage when modelling natural rather than exact fractals in that very small- 
scale effects will be neglected. 

It is desirable to have a theoretical model that is mathematically manage- 
able, with basic physical features, such as the apparent dimension, derivable 
from a mathematical argument. The model should account for the dependence of 
these features on the various parameters, and, ideally, be predictive as well as 
descriptive. Fractal phenomena in nature are often rather complicated to describe, 
and various assumptions and approximations may be required in setting up and 
analysing a mathematical model. Of course, the ability to do this in a way that pre- 
serves the physical content is the mark of a good theoretical scientist! Sometimes 
differential equations may describe a physical situation, and fractal attractors can 
often result; see Section 13.5. On the other hand, analysis of differential equations 
where the boundary or domain is fractal can present problems of an entirely 
different nature. 

There is a vast literature devoted to examining fractal phenomena in these 
ways; often agreement of dimension between experiment, simulation and theory 
is surprisingly good. Moreover, analysis of dimension has been used effectively to 
isolate the dominant features underlying certain physical processes. Nevertheless, 
there is still a long way to go. Questions such as 'Why do projections of clouds 
have perimeters of dimension 1.35 over a very wide range of scales?', 'How 
does the dimension of the surface of a metal affect the physical properties such 
as radiation of heat or the coefficient of friction?' and 'What are the geological 
processes that lead to a landscape of dimension 2.2?' should be answered in the 
framework of fractal modelling. 

For most experimental purposes, box-counting dimension has to be used. With 
N s defined by one of the Equivalent Definitions 3.1, the dimension of an object 
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is usually found by estimating the gradient of a graph of log Ng against — log S. 
We often wish to estimate the dimension of a theoretical exact fractal F by 
box counting on a physical approximation E. To do this, boxes that are large 
compared with the accuracy of the approximation must be used. More precisely, 
if d(E, F) $C e where d denotes Hausdorff distance, and Ng(E) and N$(F) are 
the number of balls of radius S required to cover the sets, it is easy to see that 

N S+E (E) s: N S (F) s: N S _ S (E) 

for S > e, and this may be taken into account when estimating dim H F from 
log-log plots of measurements of E. It is also worth remembering, as we 
have indicated in various instances throughout the book, that there are often 
theoretical reasons for supposing that exact fractals have equal box and Haus- 
dorff dimensions. 

Sometimes other quantities are more convenient to measure than dimension. 
For example, in the case of a time-dependent signal, the autocorrelation func- 
tion (see Section 11.2) might be measured, with equation (11.20) providing an 
indication of the dimension. 

We now examine in more detail some specific physical examples where fractal 
analysis can aid understanding of physical processes. 



18.1 Fractal growth 

Many natural objects grow in an apparently fractal form, with branches repeatedly 
splitting and begetting smaller side branches. When viewed at appropriate scales, 
certain trees, root systems and plants (in particular more primitive ones such as 
lichens, mosses and seaweeds) appear as fractals. Forked patterns of lightning 
or other electrical discharges, and the 'viscous fingering' that occurs when water 
is injected into a viscous liquid such as oil also have a branched fractal form. 
During electrolysis of copper sulphate solution, the copper deposit at the cathode 
grows in a fractal pattern. 

The biological laws that govern plant growth are far too complex to be used as 
a basis for a mathematical model. However, other phenomena may be modelled 
by relatively simple growth laws and we examine some of these. 

A simple experiment demonstrates fractal growth by electrolysis of copper 
sulphate (CUSO4); see figure 18.1. The bottom of a circular dish is covered with 
a little copper sulphate solution. A copper cathode is suspended in the centre of 
the dish and a strip of copper is curved around the edge of the dish to form an 
anode. If a potential of a few volts is applied between the electrodes, then, after 
a few minutes, a deposit of copper starts to form around the cathode. After half 
an hour or so the copper deposit will have extended into fractal fingers several 
inches long. 

The mechanism for this electrolysis is as follows. In solution, the copper sul- 
phate splits into copper Cu 2+ ions and sulphate SO4 2- ions which drift around 
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copper sulphate solution 




Copper anode 



Figure 18.1 Electrolysis of copper sulphate leads to fractal-like deposits of copper grow- 
ing outwards from the cathode 



in a random manner. When the voltage is applied, the copper ions that hit the 
cathode receive two electrons, and are deposited as copper. Copper ions that hit 
any copper already formed are also deposited as copper, so the residue grows 
away from the cathode. Assuming that the copper ions move in a sufficiently ran- 
dom manner, for example following Brownian paths (see Chapter 16), ions are 
more likely to hit the exposed finger ends than the earlier parts of the deposition 
which tend to be 'protected' by subsequent growth. Thus it is at least plausible 
that growth of the copper deposit will be in thin, branching fingers rather than 
in a solid 'block' around the cathode. 

In the experiment described, the Cu 2+ ions follow a Brownian path with a drift 
towards the cathode superimposed as a result of the electric field between cathode 
and anode. Enriching the sulphate in the solution, for example, by addition of 
sodium sulphate, screens the copper ions from the electric field. Fractal deposits 
still occur, but this situation is more convenient for mathematical modelling since 
the Cu 2+ ions may be assumed to follow Brownian paths. A similar process 
occurs in electrolysis of zinc sulphate (ZnSCU) with a zinc anode and carbon 
cathode, with fingers of zinc growing out from the cathode. 

The diffusion-limited aggregation (DLA) model provides a convincing simu- 
lation of the growth. The model is based on a lattice of small squares. An initial 
square is shaded to represent the cathode, and a large circle is drawn centred on 
this square. A particle is released from a random point on the perimeter of the 
circle, and allowed to follow a Brownian path until it either leaves the circle, or 
reaches a square neighbouring a shaded one, in which case that square is also 
shaded. As this process is repeated a large number of times, a connected set of 
squares grows outward from the initial one. It is computationally more conve- 
nient to let the particle follow a random walk (which gives an approximation to 
a Brownian path), so when the particle is released, it moves, with probability j 
each, left, right, up or down to a neighbouring square, continuing until it leaves the 
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Figure 18.2 The diffusion-limited aggregation (DLA) model. A particle is released from 
a random point a on the circle and performs a random walk until it either leaves the circle 
or reaches a square b next to one that has already been shaded, in which case this square 
is also shaded 



circle or occupies a square next to a shaded one; see figure 18.2. (There are ways 
of shortening the computation required; for example, if the particle is k squares 
away from the shaded part the particle might as well move k steps at once.) 

Running the model for, say, 10,000 shaded squares gives a highly branched 
picture (figure 18.3) that resembles the patterns in the electrolysis experiment. 
Main branches radiate from the initial point and bifurcate as they grow, giving 
rise to subsidiary side branches, all tending to grow outwards. It is natural to 
use box-counting methods to estimate the dimension of these structures on scales 
larger than a square side, and there is a remarkably close agreement between 
the electrolysis experiment and the simulation, with a value for the dimension of 
about 1.70, and 2.43 for the 3-dimensional analogue. 

The DLA model may be thought of as a representation of a succession of ions 
released from a distance one after another. Whilst this provides a good model for 
the form of the deposit, it gives little idea of its development with time, which 
depends on a large number of ions in simultaneous random motion that adhere 
to the copper deposit on meeting it. Therefore, a 'continuous' version of this 
'discrete' model is useful. Suppose that the large number of copper ions in the 
solution have density u(x, t) at point x and time t, so that the number of ions in 
a very small disc of area Sx and centre x is u(x, t)Sx. Assuming that the ions 
follow independent Brownian paths, the ions that are in this small disc at time 
t will have spread out to have density at point x' and time t + h given by the 
2-dimensional normal distribution 



Su(x' , t + h) 



(^jrr'/r'expl - 



(x - x') 2 
2h 



u(x, t)Sx 
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Figure 18.3 A computer realization of diffusion-limited aggregation. The square was 
divided into a 700 x 700 mesh from which 16,000 squares were selected using the 
method described 



(see (16.3)) and so 

u(x',t + h) = (27Ty l h~ l 



/ eXP ("" 



'\2 



x - x) 
2h 



u(x, t)dx 



where integration is across the fluid region. This assumes that h is small relative 
to the distance of x' from the deposit and the boundary, so that the effect of 
the introduction or removal of ions can be neglected. Differentiating under the 
integral sign with respect to x' and h gives 



du/dt = ±V 2 w 



(18.1) 



as the differential equation governing the change of the ion density in the solution. 
This is the well known diffusion equation or heat equation in two dimensions. 

We need to specify the boundary conditions for this differential equation. At the 
outer boundary, say a circle of radius tq, ions are supplied at a constant rate, so 



u = Uq on |x| = r ( ). 



(18.2) 
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Denote the boundary of the copper deposit at time t by F, . Sufficiently close 
to this boundary, virtually all the ions lose their charge, so 

M = (18.3) 

on F t . Since the discharged ions are deposited as metallic copper, the rate of 
advance v of the boundary F, is in a direction n normal to F, , equal in magnitude 
to the derivative of the concentration in a direction n normal to F, . Thus, for a 
constant k, 

v n = kn- Vm (18.4) 

on F,. (We are assuming that F t is actually smooth on a very small scale.) 

Provided that the growth remains a long way from the outer electrode, the 
diffusion rate is, to a good approximation, time independent, so (18.1) may be 
replaced by Laplace's equation 

V 2 M = 0. (18.5) 

Solving this with boundary conditions (18.2) and (18.3) allows the rate of 
growth of the deposit to be found, using (18.4). 

These equations alone are too idealized to provide an accurate model. First, to 
prevent the equation being unstable with respect to surface irregularities, a short 
scale 'cut-off for the equations is required. This is provided in the square-lattice 
DLA model — if a particle gets close enough, it sticks to the aggregate. Second, 
our derivation of the differential equations assumed a continuously varying par- 
ticle density, rather than a large number of discrete particles. It is the random 
variation in motion of these individual particles that creates the irregularities that 
are amplified into the branching fingers. Thus (18.4) needs to be modified to 
include a random perturbation 

v„ — kn ■ Vm + p (18.6) 

where p may be thought of as a 'noise' term. Both of these features are present 
in the square-lattice DLA model, which is consequently more suitable for simu- 
lation of the growth form than direct numerical attempts to solve the differen- 
tial equations. 

One interpretation of the square-lattice DLA model is as providing a spatial 
solution of equations (18.2)— (18.5) subject to a small random perturbation of the 
boundary F, . Surprisingly, the same differential equations and boundary condi- 
tions describe several rather different physical phenomena. The DLA model may 
therefore be expected to apply to some degree in these different cases. 

The growth of viscous fingers in a fluid is an example. Suppose two glass 
plates are fixed a small distance apart (perhaps i mm) and the region in between 
is filled with a viscous liquid such as an oil. (This apparatus is called a Hele-Shaw 
cell.) If a low-viscosity liquid such as water is injected through a small hole in 
one of the plates, then, under certain conditions, the water spreads out into the 
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oil in thin highly branched fingers. The patterns resemble closely the deposits of 
copper in the electrolysis experiment. 

Lubrication theory tells us that in this situation the velocity of flow v of the 
oil is proportional to the pressure gradient. 

v = -cVp (18.7) 

where p(x) is the pressure at a point x in the oil. The oil is assumed incom- 
pressible, so the velocity has zero divergence V • v = 0, giving 

V 2 /? = 

throughout the oil. If the viscosity of the water is negligible compared with that 
of the oil, then the pressure throughout the water is effectively constant. Thus 
we have the boundary conditions 

p(x) = Po 

at the fluid interface, and 

p(x) = 

at a large distance ro from the point of injection. Thus the pressure difference 
u(x) — po — p(x) satisfies the differential equation (18.5) and boundary con- 
ditions (18.2) and (18.3) of the electrolysis example. Furthermore, at the fluid 
interface, the pressure gradient in the oil is normal to the boundary (since the 
pressure is constant on the boundary), so (18.4) gives the rate of advance of the 
boundary, v n — —kn ■ V p, with short-range cut-off provided by surface tension 
effects. The pressure is analogous to the ion density in the electrolysis example. 

It is perhaps, therefore, not surprising that under certain conditions the viscous 
fingers resemble the patterns produced by the square-lattice DLA model. Whilst 
the element of randomness inherent in the electrolysis example is lacking, irreg- 
ularities in the interface are nevertheless amplified to give the fingering effect. 

A very similar situation pertains for fluid flow through a porous medium — 
(18.7) is Darcy's law governing such flow. Fractal fingering can also occur in 
this situation. 

Electrical discharge in a gas provides a further example. The electrostatic 
potential u satisfies Laplace's equation V 2 m = away from the ionized region 
of discharge. The ionized path conducts well enough to be regarded as being at 
constant potential, so u satisfies the same boundary conditions as in the viscous 
fingering example. The (questionable) assumption that the rate of breakdown is 
proportional to the electric field gives equation (18.4). This is another example 
with similar differential equations, for which the square-mesh DLA model pro- 
vides a realistic picture. 

Under suitable experimental conditions, the growth patterns in electrolysis, vis- 
cous fingering and electrical discharge have a dimension of about 1.7 when esti- 
mated over a suitable range of scales. This coincides with the value obtained from 
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computer studies of square-mesh DLA. Although the theoretical explanations of 
such phenomena are not always entirely satisfactory, the universality of this 
dimension is very striking. 



18.2 Singularities of electrostatic and gravitational potentials 

The electrostatic potential at a point x due to a charge distribution /i or the 
gravitational potential due to a mass distribution /x in IR 3 is given by 



f d/x(y) 
J \x-y\' 



4>{x) = I T JZ ^ 1 . (18.8) 

J \x-y\ 

We show that the dimension of the singularity set of the potential, i.e. the set of 
x for which <p(x) — oo, cannot be too large. 

Proposition 18.1 

Let n be a mass distribution of bounded support on R 3 . Suppose that the potential 
(18.8) has singularity set F — {x : <p(x) — oo}. Then dim H F < 1. 

Proof. Suppose s > 1. Let x e D& 3 and write m(r) — fi(B(x, r)) for r > 0. Sup- 
pose that there are numbers a > 0, c > such that m{r) SC cr s for all < r ^ 
a. Then 

, , dii(x) 

4>{x) 



x-y\ 

X) 



_ f dfi(x) f 

J\x-y\^a \X — y\ J\x-y\>a 

< r dm(r) | f dK 

Jr=Q r J\x-y\>a O 

^[r- l m(r)f Q + r~ 2 m(r)dr + a~ l ii(W?) 
Jo 



'a 
^ c(l + (s - l)" 1 )^' 5 " 1 + ^"'//(K 3 ) < ex), 

after integrating by parts. So, if x e F , we must have that lim r ^ 0O (/x(5(x, r))/r s ) 
^ c for all c > 0. It follows from Proposition 4.9(b) that H"(F) — for s > 1, as 
required. □ 

Often ijl is expressible in terms of a 'density function' /, so that /i(A) — 
,f A f( x )dx for Borel sets A, and (18.8) becomes 

<Kx) = [ r^rdv. (18.9) 

J \x-y\ 

Given certain conditions on /, for example if f \f(x)\ p dx < oo for some p > 1, 
similar methods can be used to place further bounds on the dimension of the 
singularity set, see Exercise 18.5. 
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It is easily verified that, if / is a sufficiently smooth function, then (18.9) is 
the solution of Poisson's equation 

V 2 = -4nf 

satisfying <p(x) — > Oas \x\ —> oo. For a general integrable function / the poten- 
tial <f> need not be differentiable. Nevertheless (18.9) may be regarded as a weak 
solution of Poisson's equation in a sense that can be made precise using the 
theory of distributions. This technique extends, in a non-trivial way, to give 
bounds for the dimension of the singularity sets of weak solutions of other partial 
differential equations. 



18.3 Fluid dynamics and turbulence 

Despite many years of intense study, turbulence in fluids is still not fully under- 
stood. Slowly moving fluids tend to flow in a smooth unbroken manner, which 
is described accurately by the Navier-Stokes equations — the fundamental dif- 
ferential equations of fluid dynamics. Such smooth flow is termed laminar. At 
higher speeds, the flow often becomes turbulent, with the fluid particles following 
convoluted paths of rapidly varying velocity with eddies and irregularities at all 
scales. Readers will no doubt be familiar with the change from laminar to turbu- 
lent flow as a tap is turned from low to full. Although the exact form of turbulent 
flow is irregular and unpredictable, its overall features are consistently present. 

There is no uniformly accepted definition of turbulent flow — this has the 
advantage that it can reasonably be identified with any convenient 'singular fea- 
ture' of a flow description. We consider a model in which turbulence is manifested 
by a significant local generation of heat due to viscosity, i.e. 'fluid friction', at 
points of intense activity. 

At reasonably small scales, turbulence may be regarded as isotropic, i.e. direc- 
tion independent. Our intuitive understanding of isotropic turbulence stems largely 
from the qualitative approach of Kolmogorov rather than from an analysis of dif- 
ferential equations. Kolmogorov' s model is based on the idea that kinetic energy 
is introduced into a fluid on a large scale, such as by stirring. However, kinetic 
energy can only be dissipated (in the form of heat) on very small scales where the 
effect of viscosity becomes important. At intermediate scales dissipation can be 
neglected. If there are circulating eddies on all scales, then energy is transferred 
by the motion of the fluid through a sequence of eddies of decreasing size, until it 
reaches the small eddies at which dissipation occurs. If, as Kolmogorov assumed, 
the fluid region is filled by eddies of all scales, then dissipation of energy as heat 
should occur uniformly throughout the fluid. 

Let s(x) be the rate of dissipation per unit volume at the point x, so that the 
heat generated in a small volume SV around x in time St is s(x)SVSt. Then, on 
the assumption of uniform dissipation 

s(x) = e for all x in D 
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where s is the rate of input of energy into the fluid region D, which is assumed 
to have unit volume. 

Although such 'homogeneous' turbulence is appealing in its simplicity, it 
is not supported by experimental observations. Measurements using a hot-wire 
anemometer show that in a turbulent fluid the rate of dissipation differs greatly in 
different parts of the fluid. This is the phenomenon of intermittency. Dissipation 
is high in some regions and very low in others, whereas the Kolmogorov model 
requires it to be constant. This variation can be quantified using correlation func- 
tions. For a small vector h the correlation of dissipation rates between points 
distance h apart is given by 

(e(x)s(x + h)) (18.10) 

where angle brackets denote the average over all x in D. If dissipation were con- 
stant we would have (e(x)e(x + h)) — e 2 . However, experiment indicates that 

(e(x)e(x+h)} ~e 2 \h\~ d (18.11) 

for a value of d between 0.4 and 0.5. 

The Kolmogorov model can be modified to explain the intermittency by assum- 
ing that, instead of the eddies at each scale filling space, the eddies fill a 
successively smaller proportion of space as their size decreases. Kinetic energy 
is introduced into the largest eddy and passed through eddies of decreasing size 
until it is dissipated at the smallest scale. Now, however, the energy and dissi- 
pation are concentrated on a small part of the fluid. The cascade of eddies may 
be visualized as the first k stages E { of the construction of a self-similar fractal 
F (see Chapter 9) where k is quite large, with dissipation occurring across the 
kth. stage E^. For convenience, we assume that each £th level basic set of E t is 
replaced by a constant number of sets of equal size to form E i+ \. 

If A is a subset of D, we define /x(A) = f A e(x)dx as the total rate of dis- 
sipation of energy in the set A; thus ijl{D) — s, the rate of energy input. Then 
/x has the properties of a mass distribution on D. Moreover, if we assume that 
the rate of dissipation in each component of £, is divided equally between the 
equal-sized subcomponents in Ei + \, we have, as a simple consequence of F 
being self- similar of Hausdorff or box dimension s, that 

c{er s sC n(B(x, r)) sC c{er s 

if x e F and < r < Tq, where C\ and c-i are positive constants (see Exercise 
9.11). These inequalities hold for the limit F as the size of the dissipation eddies 
tends to 0, but also for the physical approximation E^, provided that r is larger 
than the dissipation scale. Then 



/ (s(x)s(x + h)}dh — I I e(x)e(x + h)dhdx 

J\h\^r JxeD J\h\^r 

-L 
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since dissipation is concentrated on E^, so 



/ 

J\h\ 



CisV ^ / (e(x)e(x + h))dh ^ c 2 e 2 r s . (18.12) 



'\h\^r 

This may be achieved if the correlation satisfies a power law 

{s(x)s(x + h)) ~ £ 2 \h\ s ~ 3 
for then the integral in (18.12) becomes 



£ 



4tt / eV _ Vd? = 4ns z r s /s. 

Jt=0 

Comparison with (18.11) suggests that s — 3 — d, so the hypothesis of 'fractally 
homogeneous turbulence', that dissipation is concentrated on an approximate 
fractal of dimension between 2.5 and 2.6, is consistent with experimental results. 

It is natural to seek theoretical reasons for the turbulent region to have a 
fractal form. One possible explanation is in terms of the vortex tubes in the fluid. 
According to Kelvin's circulation theorem, such tubes are preserved throughout 
the motion, at least in the approximation of inviscid flow. However, the vortex 
tubes are stretched by the fluid motion, and become long and thin. Repeated 
folding is necessary to accommodate this length, so the tubes might assume an 
approximate fractal form not unlike the horseshoe example in figure 13.5. 

The behaviour of a (viscous) fluid should be predicted by the Navier- Stokes 
equation 

— + (u ■ V)h - vV 2 M + Vp= / (18.13) 

dt 

where u is the velocity, p is the pressure, v is viscosity and / is the applied 
force density. Deducing the existence of fractal regions of activity from the 
Navier-Stokes equation is far from easy. Nevertheless, the method indicated in 
Section 18.2 may be generalized beyond recognition to demonstrate rigorously 
that, for example, the set on which a solution u(x, t) of (18.13) fails to be bounded 
for all / has dimension at most 2j. Thus it is possible to show from the equations 
of fluid flow that certain types of 'intense activity' must be concentrated on sets 
of small dimension. 



18.4 Fractal antennas 

A remarkable application of fractals is their use as antennas in high frequency 
radio and in particular in mobile telephones. There are two advantages of using 
fractals. First, the 'space-filling' nature of some fractals, such as variations on 
the von Koch curve, allow a high response fractal antenna to be fitted into a 
relatively small space. Secondly, depending on their geometry, antennas can be 
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multiband with resonant frequencies reflecting the self-similarities of the fractal, 
or alternatively can have frequency independent response, for example in the 
case of certain random fractal antennas. 

Electrodynamics, and in particular the behaviour of radio waves, is governed 
by Maxwell's equations. Assume a time dependence of the electromagnetic quan- 
tities proportional to e ,a " , where a> is the frequency, so that the electric field at 
the point x is e ,tt, 'E(.x) and the magnetic field is e la "H(x). Then Maxwell's 'curl' 
equations in a vacuum reduce to 

VxE= — j'tt>/xH 
V x H = icosE, 

where £ is the permittivity and \i the permeability of space. It is immediate that 
these equations are invariant under the pair of transformations 



kx and 



co/k, 



for every (positive or negative) scalar k, with both sides of the equations multi- 
plied by \/k under the transformations. 

If the antenna has a similar form when scaled by a factor k, the boundary 
conditions for Maxwell's equations are similar under this scaling, so the radiation 
properties might be expected to behave in a similar manner when the frequency 
is scaled by 1/1, that is when the wavelength is scaled by k. 

This is borne out in the Sierpiriski dipole, a practical instance of a fractal 
antenna, see figure 18.4. In one realization, two Sierpiriski triangles are etched on 
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Figure 18.4 Fractal antennas, showing feed lines, (a) Sierpiriski dipole, (b) random tree 
antenna, (c) von Koch dipole 
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a printed circuit board, vertex to vertex (a fractal 'bow-tie'), with the vertices fed 
by a two-wire transmission line from a transmitter or receiver. The fundamental 
frequency of the antenna corresponds to a wavelength comparable to the overall 
diameter of the Sierpiriski triangle. Sharp peaks are observed in the antenna 
response at 1, i, i, | and X times this wavelength, these ratios corresponding to 
the fundamental similarity ratios of the triangle, bearing out the above argument. 
At each of these resonances the current density is concentrated on the pair of 
similar copies of the Sierpiriski triangle nearest to the feed point and at the 
appropriate scale. Varying the construction parameters of the Sierpiriski triangle 
leads to antennas with other sequences of bands. 

Random fractals can be used to construct antennas suitable for a continuous 
range of frequencies. For example, tree antennas, which are statistical self-similar 
at all scales (such as those generated by electrochemical deposition of the form 
of figure 18.3), give a relatively constant response across a range of frequen- 
cies. These compare well with traditional antennas designed for a continuous 
frequency band. 



18.5 Fractals in finance 

Examination of graphs of share prices, exchange rates, etc., suggests some sort 
of self-affine scaling, that is for a number < a < 1, if X(t) is the share price 
at time t then the increment X(yt) — X(yto) has similar overall form to the 
increment y a (X(t) — X(t^)) for a range of t > to and y > 0. This suggests that 
a statistically self-affine process or even a deterministic self-affine function might 
be appropriate for modelling prices. Moreover, the self-affinity assumption has 
many consequences for features of the process modelled. 

Because of its central place in stochastic processes, Brownian motion (see 
Section 16.1) is the most natural statistically self-affine model to try. Together 
with its variants, it is the basis for many financial models. 

Let X be Brownian motion on [0, oo); we might think of a realization of X{t) 
as the price, or perhaps the logarithm of the price, of some stock at time t. A 
crucial feature of Brownian motion is the martingale or independent increment 
property: given the process up to time t, the expected value of X(t + h) is just 
X(t), for all h > 0, reflecting that the future value of the stock is unpredictable, 
with a profit as likely as a loss. 

If we now take X to be index-a fractional Brownian motion (Section 16.2), 
the increments are no longer independent, but are correlated, i.e. E((X(t) — 
X(0))(X(t + h) - X(t))) > or < 0, according to whether a > \ or a < \. 
Moreover, if a > \ the process has long range dependence, that is, for fixed 
small h, the covariances of the increments 

E((X(h) - X(0))(X(t +h)- X(t))) = -[(? + h) 2a + (t- h) 2a - 2t 2a ] 

~ a(2a - l)t 2a - 2 h 2 
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die away slowly as t increases, in the sense that Y1T=q E((X(h) — X(0))(X(k + h) 
— X(k))) ~ YlT=ok 2a ~ 2 diverges. An interpretation might be that if, for some 
a > i, index-a fractional Brownian motion provides a reasonable model of prices 
then we may make inference on the future behaviour of share prices by studying 
past variations. 

Many further variants are possible, for example taking cX(t) + at where X is 
index-a fractional Brownian motion and a and c are positive constants, adds a 
'drift' to the process, with an underlying upward trend of rate a. Given financial 
data over a range of t, statistical methods may be used to estimate appropriate 
values of a, a and c. 

At first sight, realizations of Brownian motion appear qualitatively similar to 
share prices or exchange rates, but a more careful examination shows marked 
differences, see figure 18.5(a) and (c). Brownian motion does not exhibit the 
sudden jumps and periods of intense activity, or 'volatility', observed in prices. 
This is exemplified by considering increments: for fixed small h > 0, the incre- 
ment X(t + h) — X(t) regarded as a function of t appears as a 'noise' of fairly 
constant amplitude if X is Brownian motion, figure 18.6(a), whereas for stock 
prices it has very marked sharp peaks and dips rising above the ambient noise 
level, corresponding to price jumps, figure 18.6(c). 

One way this might arise is if X has a multifractal structure as a function, 
that is with a relatively small set of times t where the local fluctuations are 
exceptionally large. This may be made precise in terms of the dimensions of 
the sets 

. \og\X{t + h)-X{t)\ 
: lim mi = 

A->0 log | A | 

thus, E a comprises those t at which the Holder exponent of X is a, an analogue 
of (17.16) for functions. There are several ways of modelling these more extreme 
fluctuations. A self-affine curve (Section 11.1) with a 'generator' that has several 
different gradients, has fluctuating Holder exponents and can yield graphs with 
many of the features of share prices. 

A more sophisticated approach uses multifractal time. The underlying idea is 
that prices follow Brownian or fractional Brownian motion, but with respect to 
rescaled or 'warped' time called trading time T(t). Thus, when the market is 
volatile a short period in real time t is stretched to a long period of trading time 
T, whilst at inactive periods, a long / period corresponds to a short T period, 
figures 18.5(6) and 18.6(6). 

Let fi be a measure on IR + , typically a multifractal measure. For t ^ we 
define the trading time T(t) — /x[0, t] — f d/x. Writing B" for index-a Brownian 
motion, we set X(t) — B a (T(t)). Given T , then X(t) is a zero mean Gaussian 
process with variance of increments given by 

E(X(t + h)- X(t)f = E(B a (T(t + h)) - B a (T(t))f 

= \T(t + h)- T(t)\ 2a = fi[t, t + h] 2a ~ \h\ 2ya (18.14) 
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Figure 18.5 (a) Brownian motion, (b) the same realization of Brownian motion with 
respect to 'trading time' T, given by a multifractal 'warp' of real time t, (c) daily dol- 
lar/pound exchange rates over 11 years. The combination of periods of low volatility 
interspersed with periods of high activity is more reminiscent of (b) than of (a) 
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Figure 18.6 (a) Increments of Brownian motion; (b) increments of Brownian motion 
'warped' by multifractal time; (c) real data: increments of a stock price over period of 
5h years. The character of (c) is closer to (b) than to (a) 
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is the set of times where /x has a local dimension y. Thus, the volatility of X 
at t depends on the local dimension of /x at t, and so the distribution of times 
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of high and low activity in the share prices depends on the multifractal structure 
of \i. If we develop the model to make /x a random multifractal measure, the 
trading time T becomes a random process, so that the times of high and low 
activity are random. 

Statistical methods may be applied to real price data to estimate a and the 
multifractal spectrum required for jjl, which may be related to the 'multifractal 
spectrum of X' given by dim H E a . 

Fractality is a crucial feature in mathematical modelling of finance. Here we 
have just indicated some simple models; more sophisticated techniques include 
stochastic differential equations, where the fractal nature of the process is a 
consequence of the modelling assumptions. 



18.6 Notes and references 

The literature on applications of fractals is enormous. A wide variety of applications 
are described in the books by Mandelbrot (1982), Feder (1988), Hastings and Sugi- 
hara (1993), Crilly, Earnshaw and Jones (1993), Gouyet (1996) and Meakin (1998). 
General conference proceedings include those edited by Shlesinger, Mandelbrot 
and Rubin (1984), Pietronero and Tosatti (1986), Pietronero (1989), Aharony and 
Feder (1990), Fleischmann, Tildesley and Ball (1990), Takayasu (1990), Cherbit 
(1991) and Novak (1998, 2000). 

For fractal growth and DLA, refer to Stanley and Ostrowsky (1986, 1988), 
Feder (1988) and Vicsek (1989). 

For an introduction to the ideas of turbulence, see Leslie (1973). The homoge- 
neous model of Kolmogorov (1941) was adapted to the fractally homogeneous 
model by Mandelbrot (1974); see also Frisch, Sulem and Nelkin (1978). Collec- 
tions of papers relevant to fractal aspects of turbulence include Temam (1976) and 
Barenblatt, Iooss and Joseph (1983). The books by Temam (1983, 1997) discusses 
the dimension of sets related to solutions of the Navier- Stokes equation. 

For discussion of fractal antennas, see Puente, Romeu, Rous and Cardama 
(1997), Jaggard (1997) and Hohlfeld and Cohen (1999). 

Fractals in finance are discussed in the book by Peters (1994) and in Man- 
delbrot's (1997) 'Selecta' and the many references therein; multifractal time is 
presented in Mandelbrot (1997) and Goncalves and Riedi (1999). 

There are many books on other applications. For geophysics see Scholz and 
Mandelbrot (1989), Xie (1993) and Turcotte (1997), and for fractal river basins 
see Rodriguez-Iturbe and Rinaldo (1997). Applications to chemistry may be found 
in Avinir (1989), to astronomy in Heck and Perdang (1991), to molecular bio- 
physics in Dewey (1997), to physiology and medicine in West (1990) and to 
engineering in Levy Vehel, Lutton and Tricot (1997) and Dekking, Levy Vehel, 
Lutton and Tricot (1999). Surface growth is discussed by Barabsi and Stan- 
ley (1995), and multifractal aspects of rainfall are considered by Lovejoy and 
Schertzer (1995). 
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Exercises 

18.1 Suppose that the DLA square-lattice model is run for a large number of very small 
squares. Suppose that the set obtained is an approximate fractal of dimension s. 
What power law would you expect the number of shaded squares within distance r 
of the initial square to obey? Assuming that during the process squares tend to be 
added to parts of the set further away from the initial square, how would you expect 
the 'radius' of the growth after k squares have been added to depend on kl 

18.2 Let m(t) be the mass of copper that has been deposited and r(f) be the 'radius' of the 
copper deposit after time t in the electrolysis experiment described in Section 18.1. 
It may be shown that the current flowing, and thus, by Faraday's law, the rate of 
mass deposition, is proportional to r(t). On the assumption that the growth forms 
an approximate fractal of dimension s, so that m(t) ~ cr(t) s , give an argument to 
suggest that r(t) ~ ci? 1/(j-1) . 

18.3 Verify that u(x, t) in the DLA model satisfies the partial differential equation (18.1). 

18.4 Verify that the potential in (18.9) satisfies Poisson's equation if, say, / is a twice 
continuously differentiable function with f(x) = for all sufficiently large x. 

18.5 Show that, if f(x) = for all sufficiently large x and J \f(x)\ 2 dx < oo, then the 
singularity set of <p, given by (18.9), is empty. 

18.6 Show that the argument leading to (18.12) can be adapted to the case when, say, D 
is the unit cube in R 3 and F is the product of the Cantor dust of figure 0.4 and a 
unit line segment L. (Dissipation is assumed to occur on the set E^ x L, where E^ 
is the kth stage in the construction of the Cantor dust for some large k.) 

18.7 For a fractal antenna made from a wire in the shape of a von Koch curve, if the 
fundamental frequency is &>, what are the higher resonant frequencies? 

18.8 Generalize (18.14) to show that E(|X(f + h) - X(t)\i) ~ \h\^ for t e F y (see 
Exercise 16.7). 
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problems 48-49 

properties 47-48 
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Brownian motion 37, 258-267, 259, 
261, 298, 299, 311-315 

Fourier series representation 260 

fractional 173, 267, 267-271, 269, 
275 

index-a fractional 267, 267-271, 269, 
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multifractional 27 1 
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graph 265-266 
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calculation of Hausdorff dimension 34 
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'Cantor product' 99, 100, 102, 103 
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as attractor 189, 192, 194 
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non-linear 136-137, 154 
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capacity dimension 41 
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cat map 213 
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central limit theorem 24, 259 
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chaotic attractor 189, 192 
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class C s 113, 113-118, 157 

member 113 
closed ball 4, 66 
closed interval 4 
closed loop, as attractor 201 
closed set 5, 5, 114, 201 
closure of set 6 
cloud boundaries 244, 298 
coarse multifractal spectrum 278-279 

lower 280 

upper 280 
coastline(s) 54, 244, 298 
codomain 7 
collage theorem 145 
compact set 6, 48 
complement of a set 4 
complete metric 125 
complex dynamics 216 
composition of functions 7 
computer drawings 127-128, 128, 
145-148, 196, 227, 232, 233, 234, 
239, 240, 244, 260, 269, 270-271, 
275 
conditional expectation 22 
conditional probability 19 
conformal mapping 138-139, 180, 241 
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direct 8 
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Minkowski 45 

s -dimensional 45 
continued fraction(s) 153-154 

examples 153 
continued fraction expansion 153 
continuity equation 203 
continuous dynamical systems 201-205 
continuous function 10 
continuously differentiable function 10 
contours 275 
contracting similarity 123 
contraction 123 

contraction mapping theorem 124, 125 
convergence 8 

pointwise 10 

of sequence 5 

uniform 10, 17 
convex function 181, 1 8 1 - 1 82, 287 
convex surface 182 
convolution theorem 73, 172 
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copper sulphate, electrolysis of 

300-301, 302-304 
correlation 169-173, 268, 311 
countable set 4, 32, 41 
countable stability 

of dimension print 55 

of Hausdorff dimension 32, 41 
counting measure 13 
covariance matrix 270 
cover of a set 27, 28, 30, 35-36 

see also (<5-)cover 
covering lemma 66-67 
critical exponent 54 
critical point 227, 228 
cross section 202 
cube 4, 42 
cubic polynomial, Newton's method for 

239-241, 240 
curve 53, 81 

fractal 133-134 

Jordan 53, 81 

rectifiable 81, 81-84, 85-86, 
181-182 
curve-free set 81, 83, 83-84 
curve-like set 81, 82-83, 83-84 

Darcy's law 305 

data compression 145-148 

decomposition of 1-sets 80-81, 81 

(delta- )/(<5-)cover 27, 28, 30, 36 

(delta- )/(5-)neighbourhood 4,4,45, 112, 

124 
(delta- )/(5-)parallel body 4 
dendrite 232 
dense set 6 
density 76, 76-80, 84, 88, 152 

lower 77, 84 

upper 77 
density function 306 
derivative 10 
diameter of subset 5, 27 
difference of sets 4 
differentiability 10, 137-138, 160, 182 

continuous 10 
diffusion equation 303 
diffusion limited aggregation (DLA) 

model 301-306, 302, 303 
digital sundial 96-97, 97 
dimension xxii-xxv, xxiv 

alternative definitions 39-58 

approximations to 299, 305 

of attractors and repellers 186-201 

box(-counting) 41, 41-50, 43, 44, 60, 
263, 265, 268, 272, 273, 299-300 



calculation of 34-35, 59-75, 
128-135 

of Cantor set xxiii 
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characterization of quasi-circles by 
235-237 

divider 39, 53-54 

entropy 41 

experimental estimation of 39-40, 
299-300 

finer definitions 36-37 

Fourier 74 

of graphs of functions 160-169, 161 

Hausdorff xxiv, 27, 31, 31-33, 
35-36, 40, 54, 70, 92 

Hausdorff-Besicovitch 31 

Hausdorff dimension of a measure 
209, 288 

information 41 

of intersections 1 09 - 1 1 8 

local 283 

lower box(-counting) 41, 43 

of a measure 288 

metric 41 

Minkowski(-Bouligand) 46 

modified box-counting 49, 49-50 

one-sided 54 

packing 50-53, 51 

of products 99-107 

of projections 90-97 

of random sets 246-251, 259-275 

of self-affine sets 106-107, 139-144, 
166-169 

of self-similar sets xxiv, 128-135 

similarity xxiv 

topological xxv 

upper box(-counting) 41, 43 

of von Koch curve xxiii 

of (a-)well approximable number 155 
dimension function 37 
dimension print 54-57, 55, 56 

disadvantages 55-56 

examples 56 
dimensionally homogeneous set 50 
Diophantine approximation 153, 

154-157, 205 
direct congruence 8 
Dirichlet's theorem 154 
discrete dynamical system 186, 186-201 
disjoint binary intervals 36 
disjoint collection of sets 4 
distance set 183 
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distribution 

Gaussian 23 

multidimensional normal 262 

normal 23, 260 

uniform 23 
distribution of digits 151-153 
divider dimension 39, 53-54 
domain 7 
double sector 85 
duality method 176-179, 777 
'dust-like' set 110 
dynamical systems 186-212 

continuous 20 1 - 205 

discrete 186, 186-201 
dynamics, complex 216 

eddies 308 

Egoroff s theorem 17, 79 

electrical discharge in gas 305 

electrodynamics 310 

electrolysis 300-301, 301, 302-304 

electrostatic potential 70, 305, 306-307 

(.s-)energy 70 

entropy 209-211,210 

entropy dimension 41 

ergodic theory 128, 209 

Euclidean distance or metric 3 

Euclidean space 3 

event 18 

independence of 20 
event space 18 

exchange rate variation 311,37.? 
expectation 21 

conditional 22 
expectation equation 247, 250 
experiment (probabilistic) 18 
experimental approach to fractals 39-40, 

299-300 
exterior, of loop 223 
extinction probability 250 

Fatou set 216, 235 

Feigenbaum constant 193 

figure of eight 223, 228 

filled-in Julia set 215 

finance 186,311-315 

fine (Hausdorff) multifractal spectrum 

284 
fine multifractal analysis 277, 283-286 
first return map 202, 202 
fixed point 125, 186, 216 
fluid dynamics 277, 307-309 
forked lightning 298, 300 



Fourier dimension 74 
Fourier series 206 
Fourier transform 73, 92 

methods using 73-74, 112, 171-173, 
260, 271 
fractal, definition xxii, xxv 
fractal curve 133-134,2X6 
fractal growth 300-306 
fractal interpolation 169, 770 
fractal percolation 251-255 
fractally homogeneous turbulence 309 
fractional Brownian motion 173, 267, 

269, 267-271 
fractional Brownian surface(s) 273-275 
fractions, continued 153-154 
Frostman's lemma 70 
full square 254 
function(s) 6, 6-7 

continuous 10 

convex 181, 181-182 
functional analysis 179 

gauge function 37 
Gaussian distribution 23 
Gaussian process 267 
general construction 61-62, 62 
generator 134-135 

examples 132, 133, 134 
geometric invariance 41 
geometric measure theory 53, 76 
graphs of functions 160, 160-169, 258, 

266, 267 
gravitational potential 70, 306-307 
group(s) of fractional dimension 

182-184 
group of transformations 8, 110, 111 
growth 300-306 

Hamiltonian 207-208 
Hamiltonian systems, stability of 

207-208, 212 
Hamilton's equations 207 
Hausdorff-Besicovitch dimension 31 
Hausdorff dimension xxiv, 27, 31, 31-33, 
31, 54 
of attractor 192, 193 
and box(-counting) dimension 46, 60 
Brownian motion 263, 265 
calculating 70-72, 92 
equivalent definitions 35-36 
fractional Brownian motion 268 
fractional Brownian surface 273 
Levy stable process 272 
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of a measure 209, 288 

and packing dimension 53 

and projections 90-93 

properties 32 

of self-affine set 140-144, 140, 144 
Hausdorff distance 124, 124 
Hausdorff measure 14, 27-30, 28 

and intersections 113 

and packing measure 53 

and product rule 99 

and quasi-circle 236 
Hausdorff metric 124, 124, 145, 300 
heat equation 303 
Hele-Shaw cell 304 
Henon attractor/map 103, 196-197, 197, 

198, 212, 213 
heuristic calculation(s) 34-35, 129 
histogram method, for multifractal 

spectrum 279, 283 
Holder condition 30, 32, 262, 265, 268 
Holder exponent 283, 312 
Holder function 8, 10, 30, 161 
Holder's inequality 297 
homeomorphism 10, 33 
homogeneous turbulence 308 
horseshoe map 194-195, 195, 196, 212 

image 7, 258, 275 
image encoding 145-148 
in general 109, 110 
independence 

of events 20 

of random variables 20 
independent increment 259, 268, 311 
index-a fractional Brownian function 

273 
index-a fractional Brownian motion 267, 

267-271,269, 311-312 
index-a fractional Brownian surface 

273, 274 
infimum 5 

information dimension 41 
injection 7 
integral 16-17 
integral geometry 118 
interior 6 

of loop 223 
intermittency 308 
interpolation 169, 170 
intersection formulae 110-113 
intersection(s) 4, 109-118, 110, 
265-266, 275 

large 113-118, 157 
interval 4 



interval density 152 
invariance 

geometric 41 

Lipschitz 41, 48 
invariant curves 205, 207 
invariant measure 208,208-211 
invariant set 123, 123-129, 187, 218 
invariant tori 208 
inverse function 7 
inverse image 7 
investment calculations 186 
irregular point 78 
irregular set 78, 79-80, 94 

examples xxi, 81, 180 
isolated point, as attractor 201 
isometry 7, 30 
isotropic 261 
isotropic turbulence 307 
iterated construction(s) 95-96, 96, 

180-181 
iterated function system (IFS) 123, 
123-128 

advantages 128 

attractor for 123, 123-129, 146-148, 
194, 228 

andrepellers 187-189 

variations 135-139 
iterated Venetian blind construction 

95-96, 96, 180 
iteration 186-201, 215-242 

Jarnfk's theorem 155-157, 205, 207 
Jordan curve 53, 81 
Julia set xxii, 215, 215-242, 219, 233, 
234 

Kakeya problem 176-179 
Koch curve see von Koch curve 
Kolmogorov-Arnold-Moser (KAM) 

theorem 208 
Kolmogorov entropy 41 
Kolmogorov model of turbulence 

307-309 

laminar flow 307 
landscapes 273 
Laplace's equation 304, 305 
law of averages 23-24 
Lebesgue density theorem 77, 77, 93 
Lebesgue measure 13, 16, 112, 192, 264 
n-dimensional 13, 17, 28, 112, 143, 
266 
Legendre spectrum 282 
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Legendre transform 281, 281, 282, 287 
length 13, 81 

scaling of 29 
level set 266, 275 
level-* interval 35, 62, 152 
level-* set 127 
Levy process 267, 271-273 
Levy stable process 271, 271-273 
Liapounov exponents 208-211,209, 

210, 212 
limit 8-9 

lower 9 

of sequence 5 

upper 9 
lim sup sets 113 
line segment 

dimension print 56 

uniform mass distribution on 14 
line set 176, 176-179 
linear transformation 8 
Lipschitz equivalence 236 
Lipschitz function 8, 10, 30, 103 
Lipschitz invariance 41, 48, 55, 56 
Lipschitz transformation 30, 32, 34 
local dimension 283 
local product 103, 196, 202 
local structure of fractals 76-89 
logarithmic density 41 
logarithms 10 

logistic map 189-193,797,792,212 
long range dependence 311 
loop 223, 224-225 

closed 201 
Lorenz attractor 203-204, 204 
Lorenz equations 202-203 
lower box(-counting) dimension 41 
lower coarse multifractal spectrum 280 
lower density 77, 84 
lower limit 9 
lubrication theory 305 



Mandelbrot, Benoit xxii, xxv 
Mandelbrot set 223, 223-227, 224, 230, 

233, 235 
mapping(s) 6, 8 
Markov partition 189 
martingale 248, 311 
mass distribution 11,12,277 

construction by repeated subdivision 
14-15, 75 

and distribution of digits 151-152 

and product rule 99, 101 

uniform, on line segment 14 



mass distribution principle 60, 60-61, 

131, 200 
maximum modulus theorem 231 
maximum range 160 
Maxwell's equations 310 
mean 21 

mean-value theorem 10, 137, 190 
measure(s) 11, 11-17 

counting 13 

Hausdorff 14, 27-30, 28, 53, 99, 113 

Hausdorff dimension of 209, 288 

invariant 208,208-211 

Lebesgue 13, 16, 28, 112, 192 

multifractal 211,277-296 

n -dimensional Lebesgue 13, 17, 112, 
143 

net 36, 68 

packing 50-53, 51, 88 

probability 19 

restriction of 14 

self-similar 278, 279, 280, 286 

on a set 11 

a -finite 95 

tangent 89 
r-mesh cube 42, 278 
method of moments, for multifractal 

spectrum 280-281, 283 
metric dimension 41 
middle k Cantor set 64 
middle third Cantor set xvii, xviii 

as attractor 189 

box(-counting) dimension 47 

construction of xviii, 111 

features xviii, 123 

generalization of 63-64 

Hausdorff dimension 34-35, 60, 61 

in intersections 112 

product 99, 700 

and repellers 188, 189 

and self-similarity 123, 124, 129 

and tangents 87 
Minkowski content 45 
Minkowski(-Bouligand) dimension 46 
modified box(-counting) dimension 49, 
49-50 

upper, and packing dimension 51-52 
modified von Koch curve 132, 133-134 
moment sum 280, 281 
monotonicity 

of box(-)counting dimension 48 

of dimension print 55 

of Hausdorff dimension 32, 41 

of packing dimension 5 1 
Montel's theorem 218-219, 221 
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Moser's twist theorem 207 
mountain skyline 271, 298 
multifractal(s) 211,277-296 

coarse analysis 277, 278-283 

fine analysis 277, 283-286 

self-similar 286-296 
multifractal spectrum 277-286, 283, 

289, 292-294, 315 
multifractal time 312, 313, 314 
multifractional Brownian motion 271 
multiple points 265, 275 
multivariate normal variable(s) 267 

natural fractals xxvi, 146, 147, 298-300 
natural logarithms 10 
Navier- Stokes equations 203, 307, 309 
'nearest point mapping' 181, 182 
neighbourhood 4, 5, 45 

see also (delta- )/(<5-)neighbourhood 
net measure 36, 68 
neural networks 277 
Newton's method 237-241 
non-linear Cantor set 136-137, 154 
non-removable set 180, 181 
non-singular transformation 8 
normal distribution 23, 260 
normal family 218,218-219 

at a point 218 
normal numbers 151 
number theory 151-158 

often 110 

one-sided dimension 54 
one-to-one correspondence 7 
one-to-one function 7 
onto function 7 
open ball 4 
open interval 4, 138 
open set 5, 5, 32, 41, 187 
open set condition 129, 130-134, 249 
orbit 186, 189, 216, 228 
orthogonal projection xxv, 34, 90-97, 91, 
176, 177 



perfect set 220, 242 

period 216 

period doubling 191-193 

period-p orbit 216, 232, 235 

period-p point 186, 191 

periodic orbit 228, 232 

periodic point 216, 221 

phase transition 251 

physical applications xxvi, 298-316 

pinch point 232 

plane cross section 202 

plant growth 300 

Poincare-Bendixson theorem 201, 202 

Poincare section 202 

point mass 13 

pointwise convergence 10 

Poisson's equation 307 

polynomials, Newton's method for 

237-241 
population dynamics 186, 190, 193 
porous medium, flow through 305 
(,v-)potential 70 
potential theoretic methods 70-72, 92, 

111, 248-249 
power spectrum 171, 270 
Prandtl number 203 
pre-fractal(s) 126, 127 
pre-image 7 
probability 18 

conditional 19 
probability density function 23 
probability measure 18, 19 
probability space 19 
probability theory 17-24 
product 99 

Cartesian 4, 87, 99, 100 
product formula 99-107 
projection(s) 90-97, 91 

of arbitrary sets 90-93 

of arbitrary sets of integral dimension 
95-97 

of s-sets of integral dimension 93-95 
projection theorems 90-93, 180 
Pythagoras' s theorem 266 



packing dimension 50-53, 51, 284 
and modified upper box dimension 
51-52 

packing measure 50-53, 51, 88 
and Hausdorff measure 53 

parallel body 4 

Parseval's theorem 73 

partial quotient 153 

percolation 251-255, 252, 25J, 254 



quadratic functions 223-235 
quadratic polynomial, Newton's method 

for 238-239 
quasi-circle(s) 231, 235-237, 236 

radio antennas 309-311 
rainfall distribution 277 
random Cantor set 245, 246, 246-250, 
256 
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random fractal(s) 244-256 
random function 259, 260, 270 
random mid-point displacement method 

260 
random process 259 
random variable 20 

independence of 20-21 

simple 21 
random von Koch curve xxii, xxiii, 245, 

251, 256 
random walk 258-259 
range, maximum 160 
ratio of contraction 128 
rational function 238 
rectifiable curve 81, 81-84, 181-182 

tangent 85-86 
reflection 8 
regular point 78 
regular set 78, 80-89, 93-94 

examples 8 1 

tangent 86-87 
removable set 179,179-181 
repeller 187, 187-189, 215 
repelling point 216, 233 

closure of 221 
reservoir level variations 160 
residence measure 208-209, 277, 283 
restriction of a measure 14 
rigid motion 8, 109, 111 
ring(s) of fractional dimension 182-184 
Rossler band 204, 204 
rotation 8 



sample function 259, 262, 265 

sample space 18 

scalar multiple 4 

scaling property 29, 29 

sections, parallel 105 

sector, double 85 

self-affine curve 166-169,768,312 

construction of 166, 167 
self-affine set 106-107,707,139, 
139-145 

as attractor 139, 740, 142, 144, 146 

construction of 707, 742 
self-similar fractal, with two different 

similarity ratios xxi 
self-similar measure 280, 286, 294 

construction of 278, 279 

multifractal spectrum 288, 293-296 
self-similar multifractals xxiv, 286-296 
self-similar set xxiv, 128-135, 128 

similarity dimension xxiv 



see also middle third Cantor set; 

Sierpihski gasket; von Koch curve 
self-similarity xxii 
sensitive dependence on initial conditions 

187, 194, 222 
s-set 32, 69, 76 

0-set 77 

1-set 80-84, 86-87, 93-94 

tangent to 84-88, 85 
set theory 3-6 

share prices 160, 258, 277, 311-315 
Siegel disc 232 
Sierpiriski dipole 310-311 
Sierpiriski gasket or triangle xx, xx, 129, 

132, 256 
(sigma- )la -finite measure 95 
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